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S. PNEUMONIAE ANTIGENS 
The present invention relates to isolated nudeic acid molecules, which encode antigens for Streptococcus 
pneumoniae^ which are suitable for use in preparation of pharmaceutical medicaments for the prevention 
and treatment of bacterial infections caused by Streptococcus pneumoniae. 

Streptococcus pneumoniae (Pneumococcus) is a lancet-shaped, gram-positive, facultative anaerobic 
bacterium. It is arify the mc^sulated organism that is pathogenic for humans and experimental animals. 
Capsules are antigenic and form the basis for classifying pneumococd by serot3rpes. Ninety serotypes 
have been identified, based on their reaction wifli type-specific antiseta. Most S. pneumoniae serotypes 
have been shown to cause serious disease, and the ten most common serotypes are estimated to account 
for about 62% of invasive disease worldwide. The ranking and serotype prevalence differs by age group 
and geographic area. 

Pneumococd are common inhabitants of tJie respiratory trae!^ and may be isolated from the nasopharynx 
of 5% to 70% of normal adults. Rates of asymptomatic carriage vary with age, environment and the 
presCTce of upper respiratory infections. Qnty 5%-10% of adults without childien are carriers. In sdiools 
and orphanages, 27% to 58% of students and residents may be carriers. On military installations, as 
many as 50% to 60% of service persoimel may be Carriers. The duration of carriage varies and is generally 
longer in children than Adults (reviewed in Efddemiology and Prevention of Vacdne-Freventable 
Diseases, 7th Edition-Second Printings The Pink Book). 

The relationship of carriage to the development of natural immunity is poorly imderstood. Jh addition, 
the immunologic mechanism that allows disease to occur in a carrier is poorly understood. 

Streptococcus pneumoniae is an important agent of human disease at the extremities of age and in those 
who have underlying disease. Pneumococcal disease kills more people - in the US 40,000 or more each 
year - than all other vaccine preventable diseases combined. The ma|or clinical syndromes of 
pneumococcal disease include pneumonifi^ bacteremia/ and meningitis. The disease most oftm occurs 
when a predisposing condition exists, particularly pulmonary disease. It is a common bacterial 
complication of antecedent viral respiratory infection suchi as influenza and measles, and of chronic 
conditions such as chronic obstructive pubnonary disease, diabetes, congestive heart failure i^ial failure, 
smoking and alcoholism. Pneumococcal infections are more common during the winter and in early 
spring when respiratory diseases are more prevalent. Immunodeficiency (splenic dysfunction, iatrogen, 
etc.) is a risk factor for development of fatal pneumococcal infections, because of decreased bacterial 
clearance and lack of antibodi^. The incubation poiod is short, 1-3 days. Ssnnptomes include an abrupt 
onset of fever and shaking chills or rigor, productive cough, pleuritic chest pain, dyspnoe, tachycardia 
and hypoxia. 

S. pneumoniae is responsible for 88% of bacteremia infections in tiie US. Pneumonia is the most common 
fonn of invasive pneumococcal diseases: 150.00(^570.000 cases per year (US). 36% of adult community- 
acquired and 50% of hospital-acquired pneumonia is caused by S. pneumoniae (US). The incidence of 
disease among adults aged 65 years and older has been rq>orted to be •*'60 cases/100.000. Case fatality 
rates for tliis disease increase from 1A% for those aged two or younger to as hig^ as 20.6% among tiiose 
aged 80 or older. Diseases caused by influoiza and Pneumococcus are together iJie fiftti leading cause of 
death for persons aged 65 and older. Mortality attdbutable to these pathogens is more than 90% in this 
age group. Bacteremia occurs in about 25^% of patimts with pneumonia. The overall mortality rate of 
bacteremia! is about 20%, but may be as hi^ as 60% in elderly people. In 1998, 51% of all deaths 
attributable to invasive pneumococcal diseases occurred in age group above 65 years. Pneumococd cause 
13%-19% of all cases of bacterial meningitis in the United States. An estimated 3,000 to 6,000 cases of 
pneumococcal meningitis occur each year. One-quarter of patients vdtti pneumococcal meningitis also 
have pneumonia. The clinical symptoms, spinal fluid profile and neurologic complications are similar to 
other fonns of purulent bacterial meningitis (reviewed in Epidemiolo^ and Prevention of Vacdne- 
Preveniable Diseases^ 7th EditionrSecond Printing The Pink Book^ 
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In children, Pneumococd are a common cause of acute otitis media, and are detected in 28%-55% of 
middle ear aspirates. By age 12 months, 62% of children have had at least one episode of acute otitis 
media. Middle ear infections are the most frequent reasons for pediatric office visits in the United States, 
resulting in over 20 million visits annually. Complications of pneimiococcal otitis media may include 
mastoiditis and meningitis. Bactereixiia without a known site of infection is the most common invasive 
clinical presentation aunong children <2 years of age, accounting for approximately 70% of invasive 
disease in this age group. Bacteremic pnetimonia accounts for 12%-16% of invasive pneimnococcal disease 
among children <2 years of age. With the decline of invasive Hib disease, S. pteunwniae has become the 
leading cause of bacterial meningitis among children <5 years of age in the United States. Children <L 
year have the highest rates of pneumococcal meningitis, approximately 10 cases per 100,000 population. 
The burden of pneumococcal disease among children <5 years of age is significant. An estimated 17,000 
cases of invasive disease occur each year, of which 13,000 are bacteremia without a known site of 
infection and about 700 are of meningitis. An estimated 200 children die every year as a result of invasive 
pneumococcal disease. Although not considered invasive disease, an estimated 5 million cases of acute 
otitis media occur each year among children <5 years of age (reviewed Epidemiology and Prevention of 
Vacdne-Freventable Diseases, 7d\ Edition-Second Printing, Hie Pink Book). 

A definitive diagnosis of infection with Streptococcus pneumoniae generally relies on isolation of the 
organism from blood or otitier normally sterile body sites. Tests are also available to detect capsular 
polysaccharide antigen in body fluids. 

PenidUin is the drug of choice for treatment However, successful implementation of anti-infective 
therapy has become increasingly difficult because of widespread antimicrobial resistance. Resistance to 
penidUin is rising, and according to recent reports it reaches - 25% in the US {Whitney, C. et al., 2000}. 
The proportion of macrolide-resistant strains reached - 20 % {Hyde, T. et al., 2001}. Use of antimicrobial 
agents is highly correlated with the increase in resistance of S, pneumoniae to ^lactams and macrolides 
{McCormick, A. et al., 2003). 

However, even with effective antibiotic therapy (sensitive strains), the case fatality rate of invasive 
disease is high with an average of 10% in tiKe developed world and can be much higher with certain 
serotypes, in elderly patients and in cases of bacteremia or meningitis (up to 80%). 

Thus, there remains a need for an effective treatment to prevent or ameliorate spneumoococcal infections, 
A vaodne could not only prevent infections by streptococci, but more specifically prevent or ameliorate 
colonization of host tissues (esp. in nasophar/nx), thereby reducing the incidence of upper respiratory 
infections and other suppurative infectiorts, such otitis media. Elimination of invasive diseases - 
pneumonia, bacteremia and meningitis, and sepsis - would be a direct consequence of reducing the 
incidence of acute infection and carriage of the organism. Vaccines capable of showing cross-protection 
against the majority of S. pneumoniae strains causing human infections would also be useful to prevent or 
ameliorate infections caused by all other streptococcal spedes, namely groups A^ B, C and G. 

A vaccine can contain a whole variety of different antigens. Examples of antigens are whole-killed or 
attenuated organisms, subfractions of these organisms/tissues, proteins, or, in their most simple form, 
peptides. Antigens can also be recognized by the immune system in form of glycosylated proteins or 
peptides and may also be or contain polysaccharides or lipids. Short peptides can be used since for 
example cytotoxic T-cells (CTL) recognize antigens in form of short usually 8-11 amino acids long 
peptides in conjunction with major histocompatibility complex (MHC). B-cells can recognize linear 
epitopes as short as 4-5 amino adds, as well as three-dimensional structures (conformational epitopes). In 
order to obtain sustained, antigen-spedfic immune responses, adjuvants need to trigger immxme 
cascades that involve all cells of the immune system. Primarily, adjuvants are acting, but are not 
restricted in their mode of action, on so-called antigen presenting cells (APCs). These cells usually first 
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aioounter the antigen(s) followed by presentation of processed or unmodified antigofi to immune effector 
cells. Intermediate cell types may also be involved. Only effector cells with the appropriate specificity are 
activated in a productive immvme response. The adjuvant may also locally retain antigens and co-injected 
other Actors. In addition the adjuvant may act as a chemoattractant for other immune cells or may act 
locally and/or systemically as a stimulating agent for the immune system. 

Efforts to develop effective pneirniococcal vaccines began as early as 1911. However, with the advent of 
penicillin in the 1940s, interest in the vaccine declined^ imtil it was observed that many patients stiQ died 
despite antibiotic treatment. By the late 60s, efforts were again being made to develop a pol5rvalent 
vacdne. The first pneumococcal vaccines contained purified capsular polysaccharide antigen from 14 
different t5^es of pneumococcal bacteria. In 1983, a 23-valent f)olysaccharide vaccine (PPV23) was 
licensed and replaced the 14- valent vaccine, which is no longer produced. PPV23 contains 
polysaccharide antigen from 23 types of pneumococcal bacteria whidi cause 88% of bacteremic 
pneumococcal disease. In addition, cross-reactivity occurs for several capsular t3^es which accoxmt for an 
additional 8% of bacteremic disease. Two polysaccharide vaccines are available in the United States 
(Pnemnovax 23, Merck, and Pnu-Inmiune 23, Wyeth-Lederle). Both vaccines contain 25 \ig of each 
antigen per dose and include either phenol or thimerosal as a preservative. 

The first pneumococcal conjugate vaccine (PCV7, Prevnar) was licensed in the United States in 2000. It 
includes purified capsular polysaccharide of 7 serotypes of S. pneumoniae (4^ 9V, 14, 19F, 23F, 18Q and 6B) 
conjugated to a nontoxic variant of diphtheria toxin known as CRM197. The serotypes included in 
Prevnar accoxmted for 86% of bacteremia, 83% of maiingitis, and 65% of acute otitis media among 
children <6 years of age in the United States during 1978-1994 (reviewed in Epidemiology and Prevention 
of Vacdne-Preventable Diseases, 7th Edition-Second Printing The Pink Book). Additional pneumococcal 
polysaccharide conjugate vaccines containing 9 and 11 serotypes of S. pneumoniae are being developed. 
The vaccine is administered intramuscularly. After 4 doses of Prevnar vaccine, virtually all healthy 
infants develop antibody to all 7 serotypes contained in the vaccine. Prevnar has also been shown to be 
immunogenic in infants and children, including those with sickle cell disease and HIV infection. In a 
large clinical trial, Prevnar was shown to reduce invasive disease caused by vaccine serotypes, and 
reduce invasive disease caused by all serotypes, including serotjrpes not in the vacdne. Children who 
received Prevnar had fewer episodes of acute otitis media and imderwent fewer t3mipanostomy tube 
placements than tmvacdnated children. The duration of protection following Prevnar is currentiy 
unknown. Immunization with Prevnar reduces the rate of nasopharyngeal carriage of the vaccine 
serotypes, while the overall carriage rate is unaffected. Unfortunately, it has also been shown to induce 
serotype redistribution, that is the replacement of vaccine serotypes by strains, which are not covered by 
Prevnar {Pelton, S. et al., 2003}. 

Pneumococcal vaccine is recommended to be administered routinely to L, all children as part of the 
routine shildhood immtmization schedule, ii., adults 65 years of age and older and iii., persons aged >2 
years with normal immxme systems who have chronic illnesses, including cardiovascular disease, 
pulmonary disease, diabetes, alcoholism, cirrhosis, or cerd)rospinal fluid leaks. In the elderly population 
the target groups for pneumococcal vaccine and influenza vacdne overlap. These vaccines can be given at 
the same time at different sites without increased side effects. 

High mortality is observed among high-risk individuals (with underlying disease - mainly viral 
respiratory infection, immimocompromise) even with effective antibiotic therapy. The mAb approach 
targets patients with serious disease and provides immediate immune enhancement for the dearance of 
the bacteria. Through opsonization bacteria are killed within phagoc3rtic cells and not Ijrsed in the blood 
by antibiotics. This mechanism of action can help to eliminate the release of toxins (such as pneumolysin 
and other cytotoxins), which worsen the clinical condition of septic patients. Recent advances in the 
tedmology of monodonal antibody production provide the means to generate human antibody reagents 
and reintroduce antibody therapies, while avoiding the toxidties assodated with serum therapy. 
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Ixmnimoglobulms are an extremely versatile dass of antimicrobial proteins that can be used to prevent 
and treat emerging infectioxis diseases. Antibody therapy has been effective against a variety of diverse 
microorganisms reviewed in {Bumie, J. et al., 1998). 

Although capsular specific antibodies have been shown to be highly protective, it remains imclear what 
concentration of these serotype-specific antibodies protect against disease and more recently it has 
become clear ihat opsonic activity and avidity of these antibodies are more critical determinants of 
protection than concentration. 

Protein conjugate vaccines are no doubt a great new addition to the amarmatorium in the battle against 
pneumococcal disease^ but the vaccine contains a lirmted number of pneumococcal serot3^es and given 
adequate ecological pressure, replacement disease by non-vacdne serotypes remains a real threat, 
particularly in areas witii very high disease burden. 

During the last decade the immunogenidty and protective capadty of several pneumococcal proteins 
have been described in animal models and these are now being e^Iored for the development of spedes- 
common protein based vaccines. Such proteins are the Pneumococcal surface protein A (PspA, 
{McDaniel, L. et al., 1991}; {Roche, H. et al., 2003}), Pneumococcal surface adhesin A (PsaA, {Talkington, 
D. et al., 1996}), Choline binding protein A (CbpA, {Rosenow, C. et al., 1997}), LytB glucosaminidase, 
IsytC muramidase, PrtA serine protease, PhtA (histidine triad A) and Pneumococcal vaccine antigen A 
(PvaA) {Wizemann, T. et al., 2001); {Adamou, J. et al., 2001), 

Certain proteins or enzymes displayed on the surface of gram-positive organisms significantly contribute 
to pathogenesis, and might be involved in the disease process caused by these pathogens. Often, these 
proteins are involved in direct interactions with host tissues or in conceiling the bacterial surface from the 
host defense mechanisms {Navarre, W. et al., 1999}. S. pneumoniae is not an exception in this regard. 
Several surface proteins are characterized by as virulence i^ctors, important for pneumococcal 
pathogenidty reviewed in {Jedrzejas, M., 2001}. If antibodies to these proteins could offer better 
protection to humans, they could provide the source of a novel, protein-based pneumococcal vaccine to 
be used in conjunction witti or in place of the more traditional capsular polysacdiarise vaccine. The use of 
some of the above-described proteins as antigens for a potential vaccine as well as a number of additional 
candidates reviewed in {Di Guilmi, A. et al., 2002} resulted mainly from a selection based on easiness of 
identification or chance of availability. There is a demand to identify relevant antigens for S. pneumoniae 
in a more comprehensive way. 

The present inventors have developed a method for identification, isolation and production of 
hyperimmune servim reactive antigens from a specific pathogen, espedally from Staphylococcus aureus 
and Staphylococcus epidennidis (WO 02/059148). However, given the differences ia biological property, 
pathogenic function and genetic backgroimd. Streptococcus pneumoniae is distinctive from Staphylococcus 
strains. Importantly, the selection of sera for the identification of antigais from S. pneumoniae is different 
from that applied to the S. aureus screens. Three major t3rpes of human sera were (x>llected for that 
purpose. First, healthy adults below <45 years of age preferably with small children in the household 
were tested for nasophcuyngeal carriage of S. pneumoniae. A large percentage of young children are 
carriers of S. pneumoniae, and they are considered to be a source for exposure for tiieir family members. 
Based on correlative data, protective (colonization neutralizing) antibodies are likely to be present in 
exposed individuals (children witii high carriage rate in the household) who are not carriers of 5. 
pneumoniae. To be able to select for relevant serum sources, a series of ELISAs measuring anti-S. 
pneumoniae IgG and IgA antibody levels were performed with bacterial lysates and culttire supernatant 
proteins. Sera from high titer non-carriers were induded in the genomic-based antigen identification. 
Ihis approach for selection of human sera is basically very different from that used for S. aureus, where 
carriage or non-carriage state couldn't be assodated with antibody levels. Second, serum samples from 
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convalescent phase patients with invasive pneumococcal diseases were characterized and selected in the 
same way. The third group of sera, containing longitudinally collected samples were also obtained from 
individuals with invasive disease and were used mairUy for validation purposes. The main value of this 
collection is that one can follow the changes in antigen-spedfic antibody levels before diase (prae-), at the 
time of onset (acute) and during recovery (convalescent). This latter group helps in the selection of 
epitopes, which induce antibodies during disease and missing in the prae-disease state. 

The genomes of the two bacterial species S. pneumoniae and S. aureus by itself show a number of 
important differences. The genome of S. pneutnoniae contains app. 2.16 Mb, while S. aureus harbours 2.85 
Mb. They have an average GC content of 39.7 and 33%, respectively and approximately 30 to 45% of the 
encoded genes are not shared between the two pathogens. In addition, the two bacterial species require 
different growth conditions and media for propagation. While S. pneumoniae is a strictly human 
pathogen, S. aureus can also be found infecting a range of warm-blooded animals. A list of the most 
important diseases, which can be inflicted by the two pathogens is presented below. S. auret^ causes 
mainly nosocomial, opportunistic infections: impetigo, folliculitis, abscesses, boils, infected lacerations, 
endocarditis, meningitis, septic arthritis, pneumonia, osteomyelitis, scalded skin syndrome (SSS), toxic 
shock S3mdrome. S. pneumoniae causes mainly community aquired infections; upper (pharyngitis, otitis 
media) and and lower respiratory infections (pneumonia), as well as bacteremia, sepsis and meningitis. 

The complete genome sequence of a capsular serotype 4 isolate of S. pneumoniae, designated TIGR4 was 
determined by the random shotgun sequencing strategy (GenBank accession number AE005672; see 
www.tigr.org/tigrscripts/CMR2/CMRHomePage.spl). This dirucal isolate was taken from the blood of a 
30-year-old male patient in Kongsvinger, Norway, and is highly invasive and virulent in a mouse model 
of infection. 

The problem underlying the present invention was to provide means for the development of 
medicaments such as vaccines against 5. pneumoniae infection. More particularly, the problem was to 
provide an efficient, relevant and comprehensive set of nucleic acid molecules or h5^erimm\me serum 
reactive antigens from S. pneumoniae that can be used for the manufacture of said medicaments. 

Therefore, the present invention provides an isolated nucleic add molecule encoding a hjrperimmune 
serum reactive antigen or a fragment thereof comprising a nucleic add sequence, which is selected from 
the group consisting of: 

a) a nudeic acid molecule having at least 70% sequence identity to a nudeic add molecule selected 
from Seq ID No 1, 101-144. 

b) a nudeic add molecule whidi is complementary to the nudeic add molecule of a), 

c) a nudeic add molecule comprising at least 15 sequential bases of the nudeic add molecule of a) 
orb) 

d) a nudeic add molecule whidi anneals under stringent hybridisation conditions to the nudeic 
add molecule of a), b), or c) 

e) a nudeic add molecule which, Init for the degeneracy of the genetic code, would hybridise to the 
nudeic add molecule defined in a), b), c) or d). 

According to a preferred embodiment of the present invention the sequence identity is at least 80%, 
preferably at least 95%, espedally 100%. 

Furthermore, the present invention provides an isolated nudeic add molecule encoding a hjrperimmune 
serum reactive antigen or a fragment thereof comprising a nudeic add sequence selected from the group 
consisting of 

a) a nudeic add molecule having at least 96% sequence identity to a nudeic add molecule selected 
from Seq ID No 2-6, B, 10-16, 18-23, 25-31, 34, 36, 38-42, 44, 47-48, 51, 53, 55-62, 64, 67, 71-76, 78- 
79, 81-94, 96-100. 
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b) a nucleic add molecule which is complementary to the nudeic add molecule of a), 

c) a nudeic add molecule comprising at least 15 sequential bases of the nudeic add molecule of a) 
orb) 

d) a nudeic add molecule which anneals under stringent hybridisation conditions to the nudeic 
add molecule of a), b) or c), 

e) a nudeic add molecule which, but for the degeneracy of die genetic code, would hybridise to the 
nudeic add defined in a), b), c) or d). 

According to another aspect the present invention provides an isolated nudeic add molecule comprising 
a nudeic add sequence selected from the group consisting of 

a) a nudeic add molecule selected from Seq ID No 9, 17, 24, 32, 37, 43, 52, 54, 65-66, 70, 80. 

b) a nudeic add molecule which is complementary to the nudeic acid of a), 

c) a nudeic add molecule which, but for the degeneracy of the genetic code, would hybridise to the 
nudeic add defined in a), b), c) or d). 

Preferably, the nudeic add molecule is DNA or RNA. 

According to a preferred embodiment of the present invention, ttie nudeic add molecule is isolated from 
a genomic DNA, espedally from a S. pneumoniae genomic DNA. 

According to the present invention a vector comprising a nuddc add molecule according to any of the 
present invention is provided. 

In a preferred embodiment the vector is adapted for recombinant expression of the hyperimmune serum 
reactive antigens or fragments thereof encoded by the nuddc add molecule according to the present 
invention. 

The present invention also provides a host cell comprising the vector according to tiie present invention. 

According to another aspect the present invention further provides a hyperimmtme serum-reactive 
antigen comprising an amino add sequence t>eing encoded by a nudeic add molecule according to die 
present invention. 

In a preferred embodimait the amino add sequence (polypeptide) is selected from the group consisting 
of Seq ID No 145, 245-288. 

In another preferred embodiment the amino add sequence (polypeptide) is selected from the group 
consisting of Seq ID No 146-150, 152, 154-160, 162-167, 169-175, 178, 180, 182-186, 188, 191-192, 195, 197, 
199-206, 208, 211, 215-220, 222-223, 225-238, 240-244. 

In a further preferred embodiment the amino add sequence (polsrpeptide) is selected from the group 
consisting of Seq ID No 153, 161, 168, 176, 181, 187, 196, 198, 209-210, 214, 224. 

According to a further aspect the present invention provides fragments of h3rperimmune serum-reactive 
antigens selected from the group consisting of peptides comprising amino add sequences of column 
"predicted immxmogenic aa" and "location of identified immunogenic region" of Table 1; the serum 
reactive epitopes of Table 2, espedally peptides comprising amino adds 4-11, 35-64, 66-76, 101-108, 111- 
119 and 57-114 of Seq ID No 145; 5-27, 32-64, 92-102, 107-113, 119-125, 133-139, 148-162, 177-187, 195-201, 
207-214, 241-251, 254-269, 285-300, 302-309, 317-324, 332-357, 365-404, 411-425, 443-463, 470-477, 479-487, 
506-512, 515^20, 532-547, 556-596, 603-610, 616-622, 624-629, 636-642, 646-665, 667-674, 687-692, 708-720, 
734-739, 752-757, 798-820, 824-851, 856-865 and 732-763 of Seq ID No 146; 14-21, 36-44, 49-66, 102-127, 
162-167, 177-196, 45-109 and 145-172 of Seq ID No 147; 17-35, 64-75, 81-92, 100-119, 125-172, 174-183, 214- 
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222, 230-236, 273-282, 287-303, 310-315, 331-340, 392-398, 412-420, 480-505, 515-523, 525-546, 553-575, 592- 
598, 603-609, 617-625, 631-639, 644-651, 658-670, 681-687, 691-704, 709-716, 731-736, 739-744, 750-763, 774- 
780, 784-791, 799-805, 809-822, 859-870, 880-885, 907-916, 924-941, 943-949, 973-986, 1010-1016, 1026-1036, 
1045-1054, 1057-1062, 1082-1088, 1095-1102, 1109-1120, 1127-1134, 1140-1146, 1152-1159, 1169-1179, 1187- 
1196, 1243-1251, 1262-1273, 1279-1292, 1306-1312, 1332-1343, 1348-1364, 1379-1390, 1412-1420, 1427-1436, 
1458-1468, 1483-1503, 1524-1549, 1574-1588, 1614-1619, 1672-1685, 1697-1707, 1711-1720, 1738-1753, 1781- 
1787, 1796-1801, 1826-1843, 132-478, 508-592 and 1753-1810 of Seq ID Mo 148; 15-43, 49-55, 71-77, 104-110, 
123-130, 162-171, 180-192, 199-205, 219-227, 246-254, 264-270, 279-287, 293-308, 312-322, 330-342, 349-356, 
369-377, 384-394, 401-406, 416-422, 432-439, 450-460, 464-474, 482-494, 501-508, 521-529, 536-546, 553-558, 
568-574, 584-591, 602-612, 616-626, 634^646, 653-660, 673-681, 688-698, 705-710, 720-726, 736-749, 833-848, 
1-199, 200-337, 418-494 and 549-647 of Seq ID No 149; 9-30, 65-96, 99-123, 170-178 and 1-128 of Seq ID No 
150; 7-32, 34-41, 96-106, 127-136, 154-163, 188-199, 207-238, 272-279, 306-312, 318-325, 341-347, 353-360, 
387-393, 399-406, 434-440, 452-503, 575-580, 589-601, 615-620, 635-640, 654-660, 674-680, 696-701, 710-731, 
1-548 and 660-691 of Seq ID No 151; 4-19, 35-44, 48-59, 93-99, 106-111, 130-138, 146-161 and 78-84 of 

Seq ID No 152; 24-30, 36-43, 64-86, 93-99, 106-130, 132-145, 148-165, 171-177, 189-220, 230-249, 251-263, 
293-300, 302-312, 323-329, 338-356, 369-379, 390-412 and 179-193 of Seq ID No 153; 30^9, 61-67, 74-81, 90- 
120, 123-145, 154-167, 169-179, 182-197, 200-206, 238-244, 267-272 and 230-265 of Seq ID No 154; 14-20, 49- 
65, 77-86 and 2-68 of Seq ID No 155; 4-9, 26-35, 42-48, 53-61, 63-85, 90-101, 105-111, 113-121, 129-137, 140- 
150, 179-188, 199-226, 228-237, 248-255, 259-285, 299-308, 314-331, 337-343, 353-364, 410-421, 436-442 and 
110-144 of Seq ID No 156; 36-47, 55-63, 94-108, 129-134, 144-158, 173-187, 196-206, 209-238, 251-266, 270- 
285, 290-295, 300-306, 333-344, 346-354^ 366-397, 404-410, 422-435, 439-453, 466-473, 515-523, 529-543, 554- 
569, 571-585, 590-596, 607-618, 627-643, 690-696, 704-714, 720-728, 741-749, 752-767, 780-799, 225-247 and 
480-507 of Seq ID No 157; 16-25, 36-70, 80-93, 100-106 and 78-130 of Seq ID No 158; 18-27, 41-46, 50-57, 
65-71, 79-85, 93-98, 113-128, 144-155, 166-178, 181-188, 201-207, 242-262, 265-273, 281-295, 303-309, 318-327 
and 36-64 of Seq ID No 159; 7-29, 31-44, 50-59, 91-96, 146-153, 194-201, 207-212, 232-238, 264-278, 284-290, 
296-302, 326-353, 360-370, 378-384, 400-405, 409-418, 420-435, 442-460, 499-506, 529-534, 556-562, 564-576, 
644-651, 677-684, 687-698, 736-743, 759-766, 778-784, 808-814, 852-858, 874-896, 920-925, 929-935, 957-965, 
1003-1012, 1021-1027, 1030-1044, 1081-1087, 1101-1111, 1116-1124, 1148-1159, 1188-1196, 1235-1251, 1288- 
1303, 1313-1319, 1328-1335, 1367-1373, 1431-1437, 1451-1458, 1479-1503, 1514-1521, 1530-1540, 1545-1552, 
1561-1568, 1598-1605, 1617-1647, 1658-1665, 1670-1676, 1679-1689, 1698-1704, 1707-1713, 1732-1738, 1744- 
1764, 1-70, 154-189, 922-941, 1445-1462 and 1483-1496 of Seq ID No 160; 6-51, 81-91, 104-113, 126-137, 150- 
159, 164-174, 197-209, 215-224, 229-235, 256-269, 276-282, 307-313, 317-348, 351-357, 376-397, 418-437, 454- 
464, 485-490, 498-509, 547-555, 574-586, 602-619 and 452-530 of Seq ID No 161; 25-31, 39-47, 49-56, 99-114, 
121-127, 159-186, 228-240, 253-269, 271-279, 303-315, 365-382, 395-405, 414-425, 438-453 and 289-384 of Seq 
ID No 162; 9-24, 41-47, 49-54, 68-78, 108-114, 117-122, 132-140, 164-169, 179-186, 193-199, 206-213, 244-251, 
267-274, 289-294, 309-314, 327-333, 209-249 and 286-336 of Seq ID No 163; 9-28, 53-67, 69-82, 87-93, 109- 
117, 1:^-177, 201-207, 220-227, 242-247, 262-268, 305-318, 320-325 and 286-306 of Seq ID No 164; 4-10, 26- 

39, 47-58, 63-73, 86-96, 98-108, 115-123, 137-143, 148-155, 160-176, 184-189, 194-204, 235-240, 254-259, 272- 
278 and 199-283 of Seq ID No 165; 4-26, 33-39, 47-53, 59-ffi, 76-83, 91-97, 104-112, 118-137, 155-160, 167- 
174, 198-207, 242-268, 273-279, 292-315, 320-332, 345-354, 358-367, 377-394, 403-410, 424-439, 445-451, 453- 
497, 511-518, 535-570, 573^89, 592-601, 604-610 and 202-242 of Seq ID No 166; 8-30, 36-45, 64-71, 76-82, 
97-103, 105-112, 134-151, 161-183, 211-234, 253-268, 270-276, 278-284, 297-305, 309-315, 357-362, 366-372, 
375-384, 401-407, 409-416, 441-455, 463-470, 475-480, 490-497, 501-513, 524-537, 552-559, 565-576, 581-590, 
592-600, 619-625, 636-644, 646-656 and 316-419 of Seq ID No 167; 4-17, 52-58, 84-99, 102-110, 114-120, 124- 
135, 143-158, 160-173, 177-196, 201-216, 223-250, 259-267, 269-275 and 1-67 of Seq ID No 168; 6-46, 57-67, 
69-80, 82-133, 137-143, 147-168, 182-187, 203-209, 214-229, 233-242, 246-280 and 53-93 of Seq ID No 169; 7- 

40, 50-56, 81-89, 117-123, 202-209, 213-218, 223-229, 248-261, 264-276, 281-288, 303-308, 313-324, 326-332, 
340-346, 353-372, 434-443, 465-474, 514-523, 556^64, 605-616, 620-626, 631-636, 667-683, 685-699, 710-719, 
726-732, 751-756, 760-771, 779-788, 815-828, 855-867, 869-879, 897-902, 917-924, 926-931, 936-942, 981-1000, 
1006-1015, 1017-1028, 1030-1039, 1046-1054, 1060-1066, 1083-1092, 1099-1112, 1122-1130, 1132-1140, 1148- 
1158, 1161-1171, 1174-1181, 1209-1230, 1236-1244> 1248-1254, 1256-1267, 1269-1276, 1294-1299, 1316-1328, 
1332-1354, 1359-1372, 1374-1380, 1384-1390, 1395-1408, 1419-1425, 1434-1446, 1453-1460, 1465-1471, 1474- 
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1493, 1505-1515, 1523-1537, 1547-1555, 1560-1567, 1577-1605, 1633-1651, 1226-1309, 1455-1536 and 1538- 
1605 of Seq ID No 170; 4-10, 31-39, 81-88, 106-112, 122-135, 152-158, 177-184, 191-197, 221-227, 230-246, 
249-255, 303-311, 317-326, 337-344, 346-362, 365-371, 430-437, 439-446, 453-462, 474-484 and 449-467 of Seq 
ID No 171; 9-15, 24-35, 47-55, 122-128, 160-177, 188-196, 202-208, 216-228, 250-261, 272-303, 318-324, 327- 
339, 346-352, 355-361, 368-373, 108-218 and 344-376 of Seq ID No 172; 6-14, 17-48, 55-63, 71-90, 99-109, 
116-124, 181-189, 212-223, 232-268, 270-294, 297-304, 319-325, 340-348, 351-370, 372-378, 388-394, 406-415, 
421-434 and 177-277 of Seq ID No 173; 21-39, 42-61, 65-75, 79-85, 108-115 and 11-38 of Seq ID No 174; 4- 
17, 26-39, 61-76, 103-113, 115-122, 136-142, 158-192, 197-203, 208-214, 225-230, 237-251 and 207-225 of Seq 
ID No 175; 5-11, 27-36, 42-53, 62-70, 74-93, 95-104, 114-119, 127-150, 153-159, 173-179, 184-193, 199-206, 
222-241, 248-253, 257-280, 289-295, 313-319, 322-342, 349-365, 368-389, 393-406, 408-413, 426-438, 447-461, 
463-470, 476-495, 532-537, 543-550 and 225-246 of Seq ID No 176; 4-29, 68-82, 123-130, 141-147, 149-157, 
178-191, 203-215, 269-277, 300-307, 327-335, 359-370, 374-380, 382-388, 393-400, 410-417, 434-442, 483-492, 
497-503, 505-513, 533-540, 564-569, 601-607, 639-647, 655-666, 693-706, 712-718, 726-736, 752-758, 763-771, 
774-780, 786-799, 806-812, 820-828, 852-863, 884-892, 901-909, 925-932, 943-948, 990-996, 1030-1036, 1051- 
1059, 1062-1068, 1079-1086, 1105-1113, 1152-1162, 1168-1179, 1183-1191, 1204-1210, 1234-1244, 1286-1295, 
1318-1326, 1396-1401, 1451-1460, 1465-1474, 1477-1483, 1488-1494, 1505-1510, 1514-1521, 1552-1565, 1593- 
1614, 1664-1672, 1677-1685, 1701-1711, 1734-1745, 1758-1770, 1784-1798, 1840-1847, 1852-1873, 1885-1891, 
1906-1911, 1931-1939, 1957-1970, 1977-1992, 2014-2020, 2026-2032, 2116-2134, 1-348, 373-490, 573-767, 903- 
1043, 1155-1198, 1243-1482, 1550-1595, 1682-1719, 1793-1921 and 2008-2110 of Seq ID No 177; 10-35, 39-52, 
107-112, 181-188, 226-236, 238-253, 258-268, 275-284, 296-310, 326-338, 345-368, 380-389, 391-408, 410-418, 
420-429, 444-456, 489-505, 573-588, 616-623, 637-643, 726-739, 741-767, 785-791, 793-803, 830-847, 867-881, 
886-922, 949-956, 961-980, 988-1004, 1009-1018, 1027-1042, 1051-1069, 1076-1089, 1108-1115, 1123-1135, 
1140-1151, 1164-1179, 1182-1191, 1210-1221, 1223-1234, 1242-1250, 1255-1267, 1281-1292, 1301-1307, 1315- 
1340, 1348-1355, 1366-1373, 1381-1413, 1417-1428, 1437-1444, 1453-1463, 1478-1484, 1490-1496, 1498-1503, 
1520-1536, 1538-1546, 1548-1570, 1593-1603, 1612-1625, 1635-1649, 1654-1660, 1670-1687, 1693-1700, 1705- 
1711, 1718-1726, 1729-1763, 1790-1813, 1871-1881, 1893-1900, 1907-1935, 1962-1970, 1992-2000, 2006-2013, 
2033-2039, 2045-2051, 2055-2067, 2070-2095, 2097-2110, 2115-2121, 2150-2171, 2174-2180, 2197-2202, 2206- 
2228 and 1526-1560 of Seq ID No 178; 4-17, 35-48, 54-76, 78-107, 109-115, 118-127, 134-140, 145-156, 169- 
174, 217-226, 232-240, 256-262, 267-273, 316-328, 340-346, 353-360, 402-409, 416-439, 448-456, 506-531, 540- 
546, 570-578, 586-593, 595-600, 62^^32, 662-667, 674-681, 689-705, 713-724, 730-740, 757-763, 773-778, 783- 
796, 829-835, 861-871, 888-899, 907-939, 941-955, 957-969, 986-1000, 1022-1028, 1036-1044, 1068-1084, 1095- 
1102, 1118-1124, 1140-1146, 1148-1154, 1168-1181, 1185-1190, 1197-1207, 1218-1226, 1250-1270, 1272-1281, 
1284-1296, 1312-1319, 1351-1358, 1383-1409, 1422-1428, 1438-1447, 1449-1461, 1482-1489, 1504-1510, 1518- 
1527, 1529-1537, 1544-1551, 1569-1575, 1622-1628, 1631-1637, 1682-1689, 1711-1718, 1733-1740, 1772-1783, 
1818-1834, 1859-1872, 1-64 and 128-495 of Seq ID No 179; 8-28, 32-37, 62-69, 119-125, 137-149, 159-164, 
173-189, 200-205, 221-229, 240-245, 258-265, 268-276, 287-293, 296-302, 323-329 and 1-95 of Seq ID No 180; 
9-18, 25-38, 49-63, 65-72, 74-81, 94-117, 131-137, 139-146, 149-158, 162-188, 191-207, 217-225, 237-252, 255- 
269, 281-293, 301-326, 332-342, 347-354, 363-370, 373^80, 391-400, 415-424, 441-447 and 75-107 of Seq ID 
No 181; 4-24, 64-71, 81-87, 96-116, 121-128, 130-139, 148-155, 166-173, 176-184, 203-215, 231-238, 243-248, 
256-261, 280-286, 288-306, 314-329 and 67-148 of Seq ID No 182; 4-10, 19-37, 46-52, 62-81, 83-89, 115-120, 
134-139, 141-151, 168-186, 197-205, 209-234, 241-252, 322-335, 339-345, 363-379, 385-393, 403-431, 434-442, 
447-454, 459-465, 479-484, 487-496 and 404-420 of Seq ID No 183; 10-35, 46-66, 71-77, 84-93, 96-122, 138- 
148, 154-172, 182-213, 221-233, 245-263, 269-275, 295-301, 303-309, 311-320, 324-336, 340-348, 351-359, 375- 
381 and 111-198 of Seq ID No 184; 14-25, 30-42, 47-61, 67-75, 81-91, 98-106, 114-122, 124-135, 148-193, 209- 
227 and 198-213 of Seq ID No 185; 5-18, 45-50, 82-90, 97-114, 116-136, 153-161, 163-171, 212-219, 221-227, 
240-249, 267-281, 311-317, 328-337, 375-381, 390-395, 430-436, 449-455, 484-495, 538-543, 548-554, 556-564, 
580-586, 596-602 and 493-606 of Seq ID No 186; 9-25, 28-34r 37-44, 61-68, 75-81, 88-96, 98-111, 119-133, 138- 
150, 152-163, 168-182, 186-194, 200-205, 216-223, 236-245, 257-264, 279-287, 293-304, 311-318, 325-330, 340- 
346, 353-358, 365-379, 399-409, 444453 and 303-391 of Seq ID No 187; 16-36, 55-61, 66-76, 78-102, 121-130, 
134-146, 150-212, 221-239, 255-276, 289-322, 329-357 and 29-59 of Seq ID No 188; 8-27, 68-74, 77-99, 110- 
116, 124-141, 171-177, 202-217, 221-228, 259-265, 275-290, 293-303, 309-325, 335-343, 345-351, 365-379, 384- 
394, 406-414, 423-437, 452-465, 478-507, 525-534, 554-560, 611-624, 628-651, 669-682, 742-7^, 767-778, 782- 
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792, 804-812, 820-836, 79-231 and 359-451 of Seq ID No 189; 5-28, 39-45, 56-62, 67-74, 77-99, 110-117, 124- 
141, 168-176, 200-230, 237-244, 268-279, 287-299, 304-326, 329-335, 348-362, 370-376, 379-384, 390-406, 420- 
429, 466-471, 479-489, 495-504, 529-541, 545-553, 561-577, 598-604, 622-630, 637-658, 672-680, 682-688, 690- 
696, 698-709, 712-719, 724-736, 738-746, 759-769, 780-786, 796-804, 813-818, 860-877, 895-904, 981-997, 1000- 
1014> 1021-1029, 1-162, 206-224, 254-350, 414-514 and 864-938 of Seq ID No 190; 4-11, 19-49, 56-66, 68-101, 
109-116, 123-145, 156-165, 177-185, 204-221, 226-234, 242-248, 251-256, 259-265, 282-302, 307-330, 340-349, 
355-374, 377-383, 392-400, 422-428, 434r442, 462-474 and 266-322 of Seq ID No 191; 14-43, 45-57, 64-74, 80- 
87, 106-127, 131-142, 145-161, 173-180, 182-188, 203-210, 213-219, 221-243, 245-254, 304-311, 314-320, 342- 
348, 354-365, 372-378, 394-399, 407-431, 436-448, 459-465, 470-477, 484-490, 504-509, 531-537, 590-596, 611- 
617, 642-647, Z23-734, 740-751, 754-762, 764-774, 782-797, 807-812, 824-831, 838-845, 877-885, 892-898, 900- 
906, 924-935, 940-946, 982-996, 1006-1016, 1033-1043, 1051-1056, 1058-1066, 1094-1108, 1119-1126, 1129- 
1140, 1150-1157, 1167-1174, 1176-1185, 1188-1201, 1209-1216, 1220-1228, 1231-1237, 1243-1248, 1253-1285, 
1288-1297, 1299-1307, 1316-1334, 1336-1343, 1350-1359, 1365-1381, 1390-1396, 1412-1420, 1427-1439, 1452- 
1459, 1477-1484, 1493-1512, 1554-1559, 1570-1578, 1603-1608, 1623-1630, 1654-1659, 1672-1680, 1689-1696, 
1705-1711, 1721-1738, 1752-1757, 1773-1780, 1817-1829, 1844-1851, 1856-1863, 1883-1895, 1950-1958, 1974- 
1990, 172-354, 384-448, 464-644, 648-728 and 1357-1370 of Seq ID No 192; 8-27, 68-74, 77-99, 110-116, 124- 
141, 169-176, 201-216, 220-227, 258-264, 274-289, 292-302, 308-324, 334-342, 344-350, 364^72, 377-387, 399- 
407, 416-429, 445-458, 471-481, 483-500, 518-527, 547-553, 604-617, 621-644, 662-675, 767-778, 809-816, 15- 
307, 350-448 and 496-620 of Seq ID No 193; 4-17, 24-29, 53-59, 62-84, 109-126, 159-164, 189-204, 208-219, 
244-249, 274-290, 292-302, 308-324, 334-342, 344-350, 378-389, 391-397, 401-409, 424-432, 447-460, 470-479, 
490-504, 521-529, 538-544, 549-555, 570-577, 583-592, 602-608, 615-630, 635-647, 664-677, 692-698, 722-731, 
733-751, 782-790, 793-799, 56-267, 337-426 and 495-601 of Seq ID No 194; 12-22, 49-59, 77-89, 111-121, 136- 
148, 177-186, 207-213, 217-225, 227-253, 259-274, 296-302, 328-333, 343-354, 374-383, 424-446, 448-457, 468- 
480, 488-502, 507-522, 544-550, 553-560, 564-572, 587-596, 604-614r 619-625, 629-635, 638-656, 662-676, 680- 
692, 697-713, 720-738, 779-786, 833-847, 861-869, 880-895, 897-902, 911-917, 946-951, 959-967, 984-990, 992- 
1004, 1021-1040, 1057-1067, 1073-1080 and 381-403 of Seq ID No 195; 4-10, 26-31, 46-56, 60-66, 70-79, 86-94, 
96-102, 109-118, 132-152, 164-187, 193-206, 217-224 and 81-149 of Seq ID No 196; 4-21, 26-37, 48-60, 71-82, 
109-117, 120-128, 130-136, 142-147, 181-187, 203-211, 216-223, 247-255, 257-284, 316-325, 373-379, 395-400, 
423-435, 448-456, 479-489, 512-576, 596-625, 641-678, 680-688, 692-715 and 346-453 of Seq ID No 197; 10-16, 
25-31, 34-56, 58-69, 71-89, 94-110, 133-176, 186-193, 208-225, 240-250, 259-266, 302-307, 335-341, 376-383, 
410-416 and 316-407 of Seq ID No 198; 11-29, 42-56, 60-75, 82-88, 95-110, 116-126, 132-143, 145-160, 166- 
172, 184-216 and 123-164 of Seq ID No 199; 11-29, 54-63, 110-117, 139-152, 158-166, 1Z2-180, 186-193, 215- 
236, 240-251, 302-323, 330-335, 340-347, 350-366, 374-381 and 252-299 of Seq ID No 200; 18-27, 35-42, 50-56, 
67-74, 112-136, 141-153, 163-171, 176-189, 205-213, 225-234, 241-247, 253-258, 269-281, 288-298, 306-324, 
326-334, 355-369, 380-387 and 289-320 of Seq ID No 201; 7-15, 19-41, 56-72, 91-112, 114-122, 139-147, 163- 
183, 196-209, 258-280, 326-338, 357-363, 391-403, 406-416 and 360-378 of Seq ID No 202; 11-18, 29-41, 43-49, 
95-108, 142-194, 204-212, 216-242, 247-256, 264-273 and 136-149 of Seq ID No 203; 18-24, 33-40, 65-79, 89- 
102, 113-119, 130-137, 155-161, 173-179, 183-203, 205-219, 223-231, 245-261, 267-274, 296-306, 311-321, 330- 
341, 344-363, 369-381, 401-408, 415-427, 437-444, 453-464, 472-478, 484-508, 517-524, 526-532, 543-548 and 
59-180 of Seq ID No 204; 5-13, 52-65, 67-73, 97-110, 112-119, 134-155 and 45-177 of Seq ID No 205; 6-28, 
34-43, 57-67, 75-81, 111-128, 132-147, 155-163, 165-176, 184-194 208-216, 218-229, 239-252, 271-278, 328-334, 
363-376, 381-388, 426-473, 481-488, 492-^8, 507^13, 536-546, 564-582, 590-601, 607-623, 148-269, 420-450 
and 610-648 of Seq ID No 206; 4-12, 20-38, 69-75, 83-88, 123-128, 145-152, 154-161, 183-188, 200-213, 245- 
250, 266-272, 306-312, 332-339, 357-369, 383-389, 395402, 437-453, 455-470, 497-503 and 1-112 of Seq ID No 
207; 35-59, 74-86, 111-117, 122-137 and 70-154 of Seq ID No 208; 26-42, 54-61, 65-75, 101-107, 123-130, 137- 
144, 148-156, 164-172, 177-192, 213-221, 231-258 and 157-249 of Seq ID No 209; 29-38, 61-67, 77-87, 94-100, 
105-111, 118-158 and 1-97 of Seq ID No 210; 7-21, 30-48, 51-58, 60-85, 94-123, 134-156, 160-167, 169-183, 
186-191, 216-229, 237-251, 257-267, 272-282, 287-298 and 220-243 of Seq ID No 211; 6-29, 34-47, 56-65, 69- 
76, 83-90, 123-134, 143-151, 158-178, 197-203, 217-235, 243-263, 303-309, 320-333, 338-348, 367-373, 387-393, 
407-414> 416-427, 441-457, 473-482, 487-499, 501-509, 514-520, 530-535, 577-583, 590-602, 605-612, 622-629, 
641-670, 678-690, 37-71 and 238-307 of Seq ID No 212; 7-40, 121-132, 148-161, 196-202, 209-215, 221-235, 
248-255, 271-280, 288-295, 330^9, 395-409, 414-420, 446-451, 475-487, 556^, 568-575, 580-586, 588-595, 
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633-638, 643-648, 652-659, 672-685, 695-700, 710-716, !;^7-742, 749-754, 761-767, 775-781, 796-806, 823-835, 
850-863, 884-890, 892-900, 902-915, 934-941 and 406-521 of Seq ID No 213; 9-18, 24-46, 51-58, 67-77, 85-108, 
114-126, 129-137, 139-146, 152-165, 173-182, 188-195, 197-204, 217-250, 260-274, 296-313, 343-366, 368-384, 
427-434, 437-446, 449-455, 478-484, 492^06, 522-527, 562-591, 599-606, 609-618, 625-631, 645-652 and 577- 
654 of Seq ID No 214; 13-20, 26-37, 41-53, 56-65, 81-100, 102-114, 118-127, 163-188, 196-202, 231-238, 245- 
252, 266-285, 293-298, 301-306 and 19-78 of Seq ID No 215; 10-23, 32-42, 54-66, 73-91, 106-113, 118-127, 
139-152, 164-173, 198-207, 210-245, 284-300, 313-318, 330-337, 339-346, 354-361, 387-393, 404-426, 429-439, 
441-453, 467-473, 479-485, 496-509, 536-544, 551-558, 560-566, 569-574, 578-588, 610-615, 627-635, 649-675, 
679-690, 698-716, 722-734 743-754, 769-780, 782-787 and 480-550 of Seq ID No 216; 6-39, 42-50, 60-68, 76- 
83, 114-129, 147-162, 170-189, 197-205, 217-231, 239-248, 299-305, 338-344, 352-357, 371-377, 380-451, 459- 
483, 491-^9, 507-523, 537-559, 587-613, 625-681, 689-729, 737-781, 785-809, 817-865, 873-881, 889-939, 951- 
975, 983-1027, 1031-1055, 1063-1071, 1079-1099, 1103-1127, 1151-1185, 1197-1261, 1269-1309, 1317-1333, 
1341-1349, 1357-1465, 1469-1513, 1517-1553, 1557-1629, 1637-1669, 1677-1701, 1709-1725, 1733-1795, 1823- 
1849, 1861-1925, 1933-1973, 1981-2025, 2029-2053, 2061-2109, 2117-2125, 2133-2183, 2195-2219, 2227-2271, 
2275-2299, 2307-2315, 2323-2343, 2347-2371, 2395-2429, 2441-2529, 2537-2569, 2577-2601, 2609-2625, 2633- 
2695, 2699-2737, 2765-2791, 2803-2867, 2889-2913, 2921-2937, 2945-2969, 2977-2985, 2993-3009, 3023-3045, 
3073-3099, 3111-3167, 3175-3215, 3223-3267, 3271-3295, 3303-3351, 3359-3367, 3375-3425, 3437-3461, 3469- 
3513, 3517-3541, 3549-3557, 3565-3585, 3589-3613, 3637-3671, 3683-3747, 3755-3795, 3803-3819, 3827-3835, 
3843-3951, 3955-3999, 4^)03-4039, 4043-4115, 4123-4143, 4147-4171, 4195-4229, 4241-4305, 4313-4353, 4361- 
4377, 4385-4393, 4401-4609, 4513-4557, 4561-4597, 4601-4718, 4749-4768, 74-171, 452-559 and 2951-3061 of 
Seq ID No 217; 16-22, 30-51, 70-111, 117-130, 137-150, 171-178, 180-188, 191-196 and 148-181 of Seq ID No 
218; 6-19, 21-46, 50-56, 80-86, 118-126, 167-186, 189-205, 211-242, 244-267, 273-286, 290-297, 307-316, 320- 
341 and 34-60 of Seq ID No 219; 5-26, 33-43, 48-54, 58-63, 78-83, 113-120, 122-128, 143-152, 157-175, 185- 
192, 211-225, 227-234, 244-256, 270-281, 284-290, 304-310, 330-337, 348-355, 362-379, 384-394, 429-445, 450- 
474, 483-490, 511-520, 537-546, 548-554, 561-586, 590-604, 613-629, 149-186, 285-431 and 573-659 of Seq ID 
No 220; 5-26, 49-59, 61-67, 83-91, 102-111, 145-157, 185-192, 267-272, 279-286, 292-298, 306-312, 134-220, 
235-251 and 254-280 of Seq ID No 221; 5-19, 72-79, 83-92, 119-124, 140-145, 160-165, 167-182, 224-232, 240- 
252, 259-270, 301-310, 313-322, 332-343, 347-367, 384-398, 416-429, 431-446, 454-461 and 1-169 of Seq ID No 
222; 8-17, 26-31, 56-62, 75-83, 93-103, 125-131, 135-141, 150-194, 205-217, 233-258, 262-268, 281-286 and 127- 
168 of Seq ID No 223; 6-12, 69-75, 108-115, 139-159, 176-182, 194-214 and 46-161 of Seq ID No 224; 6-13, 
18-27, 39-48, 51-59, 66-73, 79-85, 95-101, 109-116, 118-124, 144-164, 166-177, 183-193, 197-204, 215-223, 227- 
236, 242-249, 252-259, 261-270, 289-301, 318-325 and 12-58 of Seq ID No 225; 4-10, 26-32, 48-60, 97-105, 
117-132, 138-163, 169-185, 192-214, 219-231, 249-261, 264-270, 292-308, 343-356, 385-392, 398-404, 408-417, 
435-441 and 24-50 of Seq ID No 226; 10-40, 42-48, 51-61, 119-126 and 1-118 of Seq ID No 227; 5-17, 40-58, 
71-83, 103-111, 123-140, 167-177, 188-204 and 116-128 of Seq ID No 228; 4-9, 11-50, 57-70, 112-123, 127-138 
and 64-107 of Seq ID No 229; 9-39, 51-67 and 1-101 of Seq ID No 230; 5-14, 17-25, 28-46, 52-59, 85-93, 99- 
104, 111-120, 122-131, 140-148, 158-179, 187-197, 204-225, 271-283, 285-293 and 139-155 of Seq ID No 231; 
42-70, 73-90, 92-108, 112-127, 152-164, 166-172, 181-199, 201-210, 219-228, 247-274, 295-302, 322-334, 336- 
346, 353-358, 396-414, 419-425, 432-438, 462-471, 518-523, 531-536, 561-567, 576-589, 594-612, 620-631, 665- 
671, 697-710, 718-731, 736-756, 765-771, 784-801 and 626-653 of Seq ID No 232; 8-28, 41-51, 53-62, 68-74, 
79-85, 94-100, 102-108, 114-120, 130-154, 156-162, 175-180, 198-204, 206-213, 281-294, 308-318, 321-339, 362- 
368, 381-386, 393-399, 407-415 and 2-13 of Seq ID No 233; 4-39, 48-65, 93-98, 106-112, 116-129 and 10-36 of 
Seq ID No 234; 25-32, 35-50, 66-71, 75-86, 90-96, 123-136, 141-151, 160-179, 190-196, 209-215, 222-228, 235- 
242, 257-263, 270-280 and 209-247 of Seq ID No 235; 5-29, 31-38, 50-57, 62-75, 83-110, 115-132, 168-195, 
197-206, 216-242, 249-258, 262-269, 333-340, 342-350, 363-368, 376-392, 400-406, 410-421, 423-430, 436-442, 
448^54, 460-466, 471-476, 491-496, 511-516, 531-536, 551-556, 571-576, 585-591, 599-605, 27-70, 219-293, 
441-504 and 512-584 of Seq ID No 236; 4-12, 14-34, 47-75, 83-104, 107-115, 133-140, 148-185, 187-196, 207- 
212, 224-256, 258-265, 281-287, 289-296, 298-308, 325-333, 345-355, 365-371, 382-395, 424-435, 441-457, 465- 
472, 483-491, 493-505, 528-534, 536-546, 552-558, 575-584, 589-600, 616-623 and 576-591 of Seq ID No 237; 
4-76, 78-89, 91-126, 142-148, 151-191, 195-208, 211-223, 226-240, 256-277, 279-285, 290-314, 317-323, 
358-377, 381-387, 391-396, 398-411, 415-434, 436-446, 454-484, 494^12, 516^23, 538-552, 559-566, 571- 
577, 579-596, 599-615, 620-627, 635-644, 694-707, 720-734, 737-759, 761-771 and 313-329 of Seq ID No 
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238; 7-38, 44-49, 79-89, 99-108, 117-123, 125-132, 137-146, 178-187, 207-237, 245-255, 322-337, 365-387, 398- 
408, 44S462, 603-608, 623-628, 644-650, 657-671, 673-679 and 111-566 of Seq ID No 239; 6-20, 22-35, 39-45, 

58- 64, 77-117, 137-144, 158-163, 205-210, 218-224, 229-236, 239-251, 26^-277, 299-307, 323-334, 353-384, 388- 
396, 399-438, 443448, 458-463, 467-478, 481-495, 503-509, 511-526, 559-576, 595-600, 612-645, 711-721, 723- 
738, 744-758, 778-807 and 686-720 of Seq ID No 240; 10-33, 3541, 72-84, 129-138, 158-163, 203-226, 243-252, 
258-264, 279-302, 322-329, 381-386, 401-406, 414-435 and 184-385 of Seq ID No 241; 4-9, 19-24, 41-47, 75-85, 
105-110, 113-146 and 45-62 of Seq ID Mo 242; 4-25, 52-67, 117-124, 131-146, 173-180, 182-191, 195-206, 215- 
221, 229-236, 245-252, 258-279, 286-291, 293-302, 314-320, 327-336, 341-353, 355-361, 383-389 and 1-285 of 
Seq ID No 243; 14-32, 38-50, 73-84, 93-105, 109-114 and 40-70 of Seq ID No 244; 5-26 and 22-34 of Seq ID 
No 245; 23-28 and 13-39 of Seq ID No 246; 8-14 and 21-34 of Seq ID No 247; 4-13, 20-29, 44-50, 59-74 and 
41-69 of Seq ID No 248; 4-9, 19-42, 48-59, 71-83 and 57-91 of Seq ID No 249; 4-14 and 10-28 of Seq ID 
No 250; 22-28, 32-42, 63-71, 81-111, 149-156, 158-167, 172-180, 182-203, 219-229 and 2749 of Seq ID No 
251; 17-27 and 23-32 of Seq ID No 252; 18-24 and 28-38 of Seq ID No 253; 9-15 and 13-27 of Seq ID No 
254; 13-22 and 18-29 of Seq ID No 255; 17-26 and 2-11 of Seq ID No 256; 4-33 and 16-32 of Seq ID No 257; 
4-10, 37-43, 54-84, 92-127 and 15-62 of Seq ID No 258; 4-14, 20-32, 35-60, 69-75, 79-99, 101-109, 116-140 and 
124-136 of Seq ID No 259; 2-13 of Seq ID No 260; 4-13, 28-42 and 42-57 of Seq ID No 261; 4-14, 27-44 and 
14-35 of Seq ID No 262; 4-12 and 1-27 of Seq ID No 263; 4-18, 39-45, 47-74 and 35-66 of Seq ID No 264; 8- 

20. 43- 77 and 17-36 of Seq ID No 265; 4-30, 35-45, 51-57 and 35-49 of Seq ID No 266; 4-24, 49-57 and 15-34 
of Seq ID No 267; 4-22 and 8-27 of Seq ID No 268; 13-25, 32-59, 66-80 and 21-55 of Seq ID No 269; 4-10, 
24-33, 35-42, 54-65, 72-^2, 98-108 and 15-30 of Seq ID No 270; 8-19 and 1747 of Seq ID No 271; 12-18, 40- 
46 and 31-52 of Seq ID No 272; 4-20, 35-78, 83-102, 109-122 and 74-86 of Seq ID No 273; 7-17, 21-41, 46-63 
and 2-20 of Seq ID No 274; 30-37 and 2-33 of Seq ID No 275; 4-13, 17-25 and 1-15 of Seq ID No 276; 17- 

31. 44- 51 and 20-51 of Seq ID No 277; 20-30 and 5-23 of Seq ID No 278; 13-33, 48-71 and 92-110 of Seq ID 
No 279; 4-9, 50-69, 76-88, 96-106, 113-118 and 12-34 of Seq ID No 280; 4-24 and 6-26 of Seq ID No 281; 7- 
26 and 14-30 of Seq ID No 282; 9-39, 46-68, 75-82, 84-103 and 26-44 of Seq ID No 283; 4-30, 33-107 and 58- 
84 of Seq ID No 284; 4-12 and 9-51 of Seq ID No 285; 12-18, 29-37 and 6-37 of Seq ID No 286; 4-21, 33-52, 
64-71 and 16-37 of Seq ID No 287; 9-19 and 2-30 of Seq ID No 288; 20-37 of Seq ID No 245; 8-27 of Seq 
ID No 246; 10-27 of Seq ID No 247; 42-59 and 52-69 of Seq ID No 248; 63-80 and 74-91 of Seq ID No 249; 
11-28 of Seq ID No 250; 28-4© of Seq ID No 251; 15-32 of Seq ID No 252; 4-20 of Seq ID No 253; 10-27 of 
Seq ID No 254; 17-34 of Seq ID No 255; 1-18 of Seq ID No 256; 16-33 of Seq ID No 257; 16-36, 3049 and 
43-62 of Seq ID No 258; 122-139 of Seq ID No 259; 1-18 of Seq ID No 260; 41-58 of Seq ID No 261; 15-35 
of Seq ID No 262; 2-27 of Seq ID No 263; 18-36 of Seq ID No 265; 34-51 of Seq ID No 266; 9-27 of Seq ID 
No 268; 22-47 of Seq ID No 269; 18-36 and 2947 of Seq ID No 271; 32-52 of Seq ID No 272; 72-89 of Seq 
ID No 273; 3-20 of Seq ID No 274; 3-21 and 15-33 of Seq ID No 275; 1-18 of Seq ID No 276; 6-23 of Seq 
ID No 278; 93-110 of Seq ID No 279; 13-34 of Seq ID No 280; 7-26 and 9-26 of Seq ID No 281; 16-33 of 
Seq ID No 282; 27-44 of Seq ID No 283; 67-84 of Seq ID No 284; 10-33 and 26-50 of Seq ID No 285; 7-25 
and 19-37 of Seq ID No 286; 17-37 of Seq ID No 287; 3-20 and 13-30 of Seq ID No 288; 62-80 and 75-93 of 
Seq ID No 145; 92-108 of Seq ID No 147; 332-349, 177-200 and 1755-1777 of Seq ID No 148; 109-133, 149- 
174, 260-285 and 460-485 of Seq ID No 149; 2647 and 42-64 of Seq ID No 150; 2241, 35-54> 115-130, 306- 
325, 401-420 and 454-478 of Seq ID No 151; 22-45 of Seq ID No 155; 156-174, 924-940, 1485-1496, 1447- 
1462 and 1483-1498 of Seq ID No 160; 457-475 of Seq ID No 161; 302-325 of Seq ID No 163; 288^05 of 
Seq ID No 164; 244-266 and 260-282 of Seq ID No 165; 204-225 and 220-241 of Seq ID No 166; 324-345, 
340-361, 356-377, 372-393 and 388-408 of Seq ID No 167; 39-64 of Seq ID No 168; 54-76 and 70-92 of Seq 
ID No 169; 1227-1247, 1539-1559, 1554-1574, 1569-1589, 1584-1604. 1242-1262, 1272-1292, 1287-1308, 1456- 
1477, 1472-1494, 1488-1510 and 1505-1526 of Seq ID No 170; 351-368 of Seq ID No 172', 179-200, 195-216, 
211-232, 227-248 and 243-263 of Seq ID No 173; 13-37 of Seq ID No 174; 208-224 of Seq ID No 175; 42-64, 

59- 81, 304-328, 323-348, 465-489, 968-992, 1399-1418, 1412-1431 and 2092-2111 of Seq ID No 177; 1528-1547 
and 1541-1560 of Seq ID No 178; 184-200, 367-388, 382-403, 409-429, 425-444 and 438-457 of Seq ID No 
179; 27-50 and 45-67 of Seq ID No 180; 114-131 and 405419 of Seq ID No 183; 113-134, 129-150, 145-166, 
161-182 and 177-198 of Seq ID No 184; 495-515 of Seq ID No 186; 346-358 of Seq ID No 187; 208-224 of 
Seq ID No 190; 178-194, 202-223, 217- 238, 288-308 and 1355-1372 of Seq ID No 192; 57-78 of Seq ID No 
194; 347-369, 364-386, 381-403, 398-420, 415-437 and 432-452 of Seq ID No 197; 347-372 of Seq ID No 198; 



wo 2004/092209 



PCT/EP2004/003984 



-12- 

147-163 of Seq ID No 199; 263-288 of Seq ID No 200; 361-377 of Seq ID No 202; 82-104, 99-121, 116-138, 
133-155 and 150-171 of Seq ID No 204; 110-130 and 125-145 of Seq ID No 205; 613-631, 626-644 and 196- 
213 of Seq ID No 206; 78-100, 95-117, 112-134 and 129-151 of Seq ID No 208; 158-180, 175-197, 192-214, 
209-231 and 226-248 of Seq ID No 209; 30-50, 45-65 and 60-79 of Seq ID No 210; 431-455 and 450-474 of 
Seq ID No 213; 579-601, 596-618, 613-635 and 630-653 of Seq ID No 214; 920-927, 98-119, 114-135, 130-151, 
146-167 and 162-182 of Seq ID No 217; 36-59 of Seq ID No 219; 194-216 and 381-404 of Seq ID No 220; 
236-251 and 255-279 of Seq ID No 221; 80-100 and 141-164 of Seq ID Mo 222; 128-154 of Seq ID No 223; 
82-100, 95-116 and 111-134 of Seq ID No 224; 55-76, 71-92 and 87-110 of Seq ID No 227; 91-106 of Seq ID 
No 229; 74-96 of Seq ID No 230; 140-157 of Seq ID No 231; 4-13 of Seq ID No 233; 41-65 and 499-523 of 
Seq ID No 236; 122-146, 191-215, 288-313, 445-469 and 511-535 of Seq ID No 239; 347-368 of Seq ID No 
241; 46-61 of Seq ID No 242; 15-37, 32^57, 101-121, 115-135, 138-158, 152-172, 220-242 and 236-258 of Seq 
ID No 243. 

The present invention also provides a process for producing a S. pneumoniae h)rperimmune serum 
reactive antigen or a fragment thereof according to the present invention comprising expressing one or 
more of the nudelc add molecules according to the present invention in a suitable expression system. 

Moreover, the present invention provides a process for producing a cell, whidi expresses a S pneumoniae 
hyperimmime serum reactive antigen or a fragment thereof according to the present invention 
comprising transforming or transfecting a suitable host cell with the vector according to the present 
invention. 

According to the present invention a pharmaceutical composition, espedally a vaccine, comprising a 
hyperimmune serum-reactive antigen or a fragment thereof as defined in the present invention or a 
nudeic add molecule as defined in the present invention is provided. 

In a preferred embodiment the pharmaceutical composition furdier comprises an immimostimulatory 
substance, preferably selected from the group comprising polycationic polymers, especially polycationic 
peptides, immunostimulatory deox3mudeotides (ODNs), peptides containing at least two LysLeuLys 
motifs, espedally kuoajullklk, neuroactive compounds, espedally human growth hormone, alumn, 
Freund's complete or incomplete adjuvants or combinations thereof. 

In a more preferred embodiment the immunostimulatory substance is a combination of either a 
polycationic polymer and immunostimulatory deox5niudeotides or of a peptide containing at least two 
LysLeuLys motife and immimostimulatory deoxynudeotides. 

In a still more preferred embodiment the polycationic polymer is a polycationic peptide, espedally 
polyar^inine. 

According to the present invention the use of a nucldc add molecule according to the present invention 
or a hyperimmune serum-reactive antigen or fragment Ihereof according to the present invention for the 
manufecture of a pharmaceutical preparation, espedally for the manufeicture of a vaccine against S. 
pneumoniae infection, is provided. 

Also an antibody, or at least an effective part thereof, whidi binds at least to a selective part of tiie 
hyperimmime serum-reactive antigen or a fragment thereof according to the present invention, is 
provided herewith. 

In a preferred embodiment the antibody is a monodonal antibody. 

In another preferred embodiment the effective part of the antibody comprises Fab fragments. 
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In a further preferred embodiment the antibody is a chimeric antibody. 
In a still preferred embodiment the antibody is a humanized antibody* 

The present invention also provides a hybridoma cell line, which produces an antibody according to the 
present invention. 

Moreover, the present invention provids a method for producing an antibody according to the present 
invention^ diaracterized by the following steps: 

o initiating an immune response in a non-human animal by administrating an hyperimmune 
serum-reactive antigen or a fragment thereof, as defined in the invention, to said animal, 

• removing an antibody containing body fluid from said animal, and 

« producing the antibody by subjecting said antibody containing body fluid to further 
purification steps. 

Accordingly, the present invention also provides a method for producing an antibody according to the 
present invention^ characteriz;ed by the following steps: 

• initiating an immune respoxtse in a non-human animal by administrating an hyperimmune 
serum-reactive antigen or a fragment thereof, as defined in the present invention, to said animal, 

• removing the spleen or spleen cells from said animal, 

• producing hybridoma ceUs of said spleen or spleen eells, 

• selecting and cloning hybridoma celb specific for said hyperimmune serum-reactive antigens or a 
fragment thereof, 

• producing the antibody by cultivation of said cloned hybridoma cells and optionally further 
purification steps. 

The antibodies provided or produced according to the above methods may be used for the preparation of 
a medicament for treating or preventing S. pneumoniae infections. 

According to another aspect the present invention provides an antagonist, which binds to a 
hyperimmime serum-reactive antigen or a fragment thereof according to the present invention. 

Such an antagonist capable of binding to a hyperimmune serum-reactive antigen or fragment thereof 
according to the present invention may be identified by a method comprising the following steps: 

a) contacting an isolated or immobilized hyperimmune serum-reactive antigen or a fragment 
thereof according to the present invention with a candidate antagonist under conditions to 
permit binding of said candidate antagonist to said hyperimmune senim-reactive antigen or 
fragment, in the presence of a component capable of providing a detectable signal in response to 
the binding of the candidate antagonist to said hyperimmune serum reactive antigen or fragment 
thereof; and 

b) detecting the presence or absence of a signal generated in response to the binding of the 
antagonist to the h3fj>erimmune serum reactive antigen or the fragment thereof. 

An antagonist capable of reducing or inhibiting tfie interaction activity of a hyperinmiune serum-reactive 
antigen or a fragment thereof according to the present invention to its interaction partner may be 
identified by a method comprising the following steps: 

a) providing a hyperimmtine serum reactive antigen or a hyperimmune fragment thereof according 
to Hie present invention, 

b) providing an interaction partner to said h}rperimmune serum reactive antigen or a fragment 
thereof, especially an antibody according to the present invention, 

c) allowing interaction of said hyperimmime serum reactive antigen or fragment thereof to said 
interaction partner to form an interaction complex. 
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d) providing a candidate antagonist, 

e) allowing a competition reaction to occur between the candidate antagonist and the interaction 
complex , 

f) determining whether the candidate antagonist inhibits or reduces the interaction activities of the 
hyperimmime serum reactive antigen or the fragment thereof with the interaction partner. 

The hyperimmune serum reactive antigens or fragments thereof according to the present invention may 
be used for the isolation and/or purification and/or identification of an interaction partner of said 
hjrperinunime serum reactive antigen or fragment thereof. 

The present invention also provides a process for in vitro diagnosing a disease related to expression of a 
hyperimmune serum-reactive antigen or a fragment thereof according to the present invention 
comprising determining the presence of a nucleic add sequence encoding said hyperimmune serum 
reactive antigen or fragment thereof according to the present invention or the presence of the 
hyperimmime serum reactive antigen or fragment thereof according to the present invention. 

The present invention also provides a process for in vitro diagnosis of a bacterial infection/ especially a 5. 
pneumoniae infectioiv comprising analyzing for the presence of a nucleic add sequence encoding said 
hyperimmime serum reactive antigen or fragment thereof according to the present invention or the 
presence of the hyperimmune serum reactive antigen or fragment thereof according to the present 
invention. 

Moreover, the present invention provides the use of a hyperimmune serum reactive antigen or fragment 
thereof according to the present invention for the generation of a peptide binding to said hyperimmune 
serum reactive antigen or fragment thereof, wherein the peptide is an anticaline. 

The present invention also provides the use of a hyperimmune serum-reactive antigen or fragment 
thereof according to the present invention for the manufacture of a functioned nudeic add, wherein the 
functional nucleic add is selected from the group comprising aptamers and spiegelmers. 

The nudeic add molecule according to the present invention may also be used for the manufacture of a 
functional ribonudeic add, wherein the functional ribonudeic add is selected from the group comprising 
ribozymes, antisense nucleic adds and siRNA. 

The present invention advantageously provides an effidentr relevant and comprehensive set of isolated 
nudeic add molecules and their encoded hyperimmune serum reactive antigens or fragments thereof 
identified from S. pneumoniae using an antibody preparation from multiple human plasma pools and 
surface expression libraries derived from the genome of S. pneumoniae. Thus, the present invention fulfils 
a widely felt demand for 5. pneumoniae antigens, vaccines, diagnostics and products useful in procedures 
for preparing antibodies and for identifying compounds effective against S. pneumoniae infection. 

An effective vaccine should be composed of proteins or polypeptides, which are expressed by all stredns 
and are able to induce high affinity, abundant antibodies against cell surface components of S. 
pneumoniae. The antibodies should be IgGl and/or IgG3 for opsonization, and any IgG subtype and IgA 
for neutralisation of adherence and toxin action. A chemically defined vaccine must be definitely superior 
compared to a whole cell vacdne (attenuated or killed), since components of S. pneumoniae, which cross- 
react with human tissues or inhibit opsonization can be eliminated, and the individual proteins inducing 
protective antibodies and/or a protective immune response can be selected. 

The approach, which has been employed for tiie present invention, is based on the interaction of 
pneumococcal proteins or peptides with the antibodies present in human sera. The antibodies produced 
against S. ptteumoniae by the human immune system and present in htunan sera are indicative of the m 
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vivo expression of the antigenic proteins and their immnnogenidty. In addition, the antigenic proteins as 
identified by the bacterial surface display expression libraries using pools of pre-selected sera, are 
processed in a second and third round of screening by individual selected or generated sera. Thus the 
present invention supplies an efficient; relevant comprehensive set of pneumoococcal antigens as a 
pharmaceutical composition, especially a vaccine preventing infection by S. pteumoniae. 

In the antigen identification program for identifying a comprehensive set of antigens according to the 
present invention, at least two different bacterial surface expression libraries are screened with several 
serum pools or plasma fractions or other pooled antibody containing body fluids (antibody pools). ITie 
antibody pools are derived from a serum collectiorv which has been tested against antigenic compoimds 
of S. pneumoniae, such as whole ceU extracts and culture supernatant proteins. Preferably, two distinct 
serum collections are used: 1. With very stable antibody repertoire: normal adults, clinically healthy 
people, who are non-carriers and overcame previous encounters or currentiy carriers of S. pneumoniae 
without acute disease and symptoms, 2. With antibodies induced acutely by the presence of the 
pathogenic organism: patients with acute disease with different manifestations (e.g. S. pneumoniae 
pharyngitis, pneumonia, bacteraemia, peritonitis, meningitis and sepsis). Sera have to react with multiple 
Pneumococcus-spedfic antigens in order to be considered hyperimmune and therefore relevant in the 
screening method applied for the present invention. 

The expression libraries as used in the present invention shotdd allow ©qjression of all potential antigens, 
€.g. derived from all secreted and surface proteins of S. pneumoniae. Bacterial surface display libraries will 
be represented by a recombinant library of a bacterial host displa5nng a (total) set of expressed peptide 
sequences of S. pneunumiae on two selected outer membrane proteins (LamB and FhuA) at the bacterial 
host membrane {Georgiou, G., 1997}; {Etz, H. et al., 2001). One of the advantages of using recombinant 
expression libraries is that the identified hyperimmune serum-reactive antigens may be instantly 
produced by expression of the coding sequences of the screened and selected clones expressing the 
hyperimmune serum-reactive antigens without further recombinant DNA technology or doning steps 
necessary. 

The comprehensive set of antigens identified by the described program according to the present 
invention is analysed further by one or more additional rounds of screening. Therefore individual 
antibody preparations or antibodies generated against selected peptides, which were identified as 
immunogenic are used. According to a preferred embodiment the individual antibody preparations for 
the second roimd of screening are derived from patients who have suffered from an acute infection with 
S. pneumoniae, espedally from patients who show an antibody titer above a certain minimum level, for 
example an antibody titer being higher than 80 percentile, preferably higher than 90 percentile, espedally 
higher than 95 percentile of the human (patient or healthy individual) sera tested. Using such high titer 
individual antibody preparations in the second screening roimd allows a very selective identification of 
the hjrperimmime serum-reactive antigens and fragments thereof from S. pneumoniae. 

Following the comprehensive screening procedure, the selected antigenic proteins, expressed as 
recombinant proteins or in vitro translated products, in case it can not be expressed in prokaryotic 
expression systems, or the identified antigenic peptides (produced synthetically) are tested in a second 
screening by a series of ELISA and Western blotting assays for the assessment of their immunogenidty 
witii a large human serum collection (minimum ^150 healthy and patients sera). 

It is important that the individual antibody preparations (which may also be the selected serum) allow a 
selective identification of the most promising candidates of all the hyperimmune serum-reactive antigens 
from ciU the promising candidates from the first round. Therefore, preferably at least 10 individual 
antibody preparations (i.e. antibody preparations (e.g. sera) from at least 10 different individuals having 
suffered ham an infection to the diosen pathogen) should be used in identifying these antigens in the 
second screening round. Of course^ it is possible to use also less than 10 individual preparations. 



wo 2004/092209 



PCT/EP2004/003984 



-16- 

however, selectivity of the step may not be optimal with a low number of individual antibody 
preparations. On the other hand, if a given hyperimmune serum-reactive antigen (or an antigenic 
fragment thereof) is recognized by at least 10 individual antibody preparations, preferably at least 30, 
especially at least 50 individual antibody preparations, identification of the hyperimmune serum-reactive 
antigen is also selective enough for a proper identification. Hyperimmune serum-reactivity may of course 
be tested with as many individual preparations as possible (e.g. with more than 100 or even vwtii more 
than 1,000). 

Therefore, the relevant portion of the hyperimmune serum-reactive antibody preparations according to 
the method of the present invention should preferably be at least 10, more preferred at least 30, especially 
at least 50 individual antibody preparations. Alternatively (or in combination) hyperimmxme serum- 
reactive antigens may preferably be also identified with at least 20%, preferably at least 30%, especially at 
least 40% of all individual antibody preparations used in the second screening round. 

According to a preferred embodiment of the present invention, the sera from which the individual 
antibody preparations for the second roimd of screening are prepared (or which are used as antibody 
preparations), are selected by their titer against S. pneumoniae (e.g. against a preparation of this pathogen, 
such as a lysate, cell wall components and recombinant proteins). Preferably, some are sielected with a 
total IgA titer above 2,000 U, especially above 4,000 U, and/or an IgG titer above 5,000 U, especially above 
12,000 U (U - units, calculated from the OD4osmn reading at a given dilution) when the whole organism 
(total lysate or whole cells) is used as antigen in the ELI5A. 

The antibodies produced against streptococci by the human immune system and present in human sera 
are indicative of tiie in vivo expression of the antigenic proteuis and their immunogenicity. The 
recognition of linear epitopes recognized by serum antibodies can be based on sequences as short as 4-5 
amino adds. Of course it does not necessarily mean that these short peptides are capable of inducing the 
given antibody in vivo. For that reason the defined epitopes, polypeptides and proteins are further to be 
tested in animals (mainly in mice) for their capacity to induce antibodies against ttie selected proteins in 
vivo. 

The preferred antigens are located on the cell surface or are secreted, and are therefore accessible 
extracellularly. Antibodies against cell wall proteins are expected to serve multiple purposes: to inhibit 
adhesion, to interfere with nutrient acquisition, to inhibit immune evasion nand to promote phagocytosis 
{Hornet M. et al., 2002}. Antibodies against secreted proteins are beneficial in neutralisation of their 
function as toxin or virulence component. It is also known that bacteria communicate with each other 
through secreted proteins. Neutralizing antibodies against these proteins will interrupt growth- 
promoting cross-talk between or within streptococcal spedes. Bioinformatic aiwlyses (signal sequences, 
cell wall localisation signals, transmembrane domains) proved to be very useful in assessing cell surface 
localisation or secretion. The experimental approach indudes the isolation of antibodies with the 
corresponding epitopes and proteins from human serum, and the generation of immune sera in mice 
against (poly)peptides selected by the bacterial surface display screens. These sera are then used in a 
third round of screening as reagents in the following assays: cell surface staining of S. pneumtmiae grown 
under differait conditions (FACS or microscopy), determirmtion of neutralizing capadty (toxin, 
adherence)/ and promotion of opsonization and phagocytosis (in vitro phagoc5rtosis assay). 

For that purpose, bacterial E, coli dones are directly injected into mice and immime sera are taken and 
tested in the relevant in vitro assay for functional opsonic or neutralizing antibodies. Alternatively, 
specific antibodies may be purified from htunan or mouse sera using peptides or proteins as substrate. 

Host defence against S. jmeumoniae relies mainly on opsonophagocytic killing mechanism. Inducing high 
affinity antibocties of the opsonic and neutralizing type by vaccination helps the innate immxme system to 
eliminate bacteria and toxins. This makes the method according to the present invention an optimal tool 
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for the identification of pneumococcal antigenic pioteins. 

The skin and mucous membranes are formidable barriers against invasion by streptococci However, 
once the skin or the mucous membranes are breached the first line of non-adaptive cellular defence 
begins its co-ordinate action through complement and phagocytes, especially the polymorphonuclear 
leukocytes (PMNs). These cells can be regarded as the cornerstones in eliminating invading bacteria. As 
Streptococcus pneumoniae is a primarily extracellular pathogen, the major anti-streptococcal adaptive 
response comes from the humoral arm of the immune system, and is mediated through three major 
mechanisms: promotion of opsonization, toxin neutralisation, and inhibition of adherence. It is believed 
that opsonization is especially important, because of its requirement for an effective phagocytosis. For 
efficient opsonization the microbial surface has to be coated with antibodies and complement factors for 
recognition by PMNs through receptors to the Fc fragment of the IgG molecule or to activated C3b. After 
opsonization, streptococci are phagocytosed and killed. Antibodies bound to specific antigens on the cell 
surface of bacteria serve as ligands for the attachment to PMNs and to promote phagocytosis. The very 
same antibodies bound to the adhesins and other cell surface proteins are expected to neutralize adhesion 
and prevent colonization. The selection of antigens as provided by the present invention is thus well 
suited to identify those that will lead to protection against infection in an animal model or in humans. 

According to the antigen identification method used herein, the present invention can surprisingly 
provide a set of comprehensive novel nucleic adds and novel hyperimmime serum reactive antigens and 
fragments thereof of S. pneumoniae, among other things, as described below. According to one aspect, the 
invention particularly relates to the nucleotide sequences encoding hyperimmune serum reactive 
antigens which sequences are set forth in the Sequence listing Seq ID No: 1-14^ 289-303 and the 
corresponding encoded amino acid sequences representing h3rperimmime serum reactive antigens are set 
forth in tiie Sequence Listing Seq ID No 145-288 and 304-318. 

In a preferred embodiment of the present invention, a nudeic add molecule is provided which exhibits 
70% identity over their entire length to a nudeotide sequence set forth with Seq ID No 1, 101-144. Most 
highly preferred are nudeic adds that comprise a region that is at least 80% or at least 85% identical over 
their entire lengtii to a nucleic add molecule set forth with Seq ID No 1, 101-144. In this regard, nucleic 
add molecules at least 90%, 91%, 92%, 93%, 94%, 95%, or 96% identical over their entire length to the 
same are particularly preferred. Furthermore, those with at least 97% are highly preferred, those with at 
least 98% and at least 99% are particularly highly preferred, with at least 99% or 993% being the more 
preferred, with 100% identity being espedally preferred. Moreover, preferred embodiments in this 
respect are nudeic adds which encode hyperimmune serum reactive antigens or fragments thereof 
(polypeptides) which retain substantially the same biological function or activity as the mature 
polypeptide encoded by said nudeic adds set forth in the Seq ID No 1, 101-144. 

Identity, as known in tiie art and used herein, is the relationship between two or more polypeptide 
sequences or two or more poljmudeotide sequences, as determined by comparing the sequences. In the 
art, identity also means the degree of sequence relatedness between polypeptide or polynudeotide 
sequences, as the case may be, as determined by the match between strings of sudi sequences. Identity 
can be readily calculated. While there exist a number of methods to measure identity between two 
poljnrmdeotide or two polypeptide sequences, the term is well known to skilled artisans (e.g. Sequence 
Afudysis in Molecular Biology, van Heinje, G., Academic Press, 1987). Preferred methods to determine 
identity are designed to give the largest match between the sequences tested. Methods to detenrune 
identity are codified in computer programs. Preferred computer program methods to determine identity 
between two sequences include, but are not limited to, GCG program package [Devereux, J. et al., 1984}, 
BLASTP, BLASTN, and FASTA {Altschul, S. et al., 1990). 

According to another aspect of the invention, nudeic add molecules are provided whidi exhibit at least 
96% identity to the nudeic add sequence set f ortfi with Seq ID No 2-6, 8, 10-16, 18-23, 25-31, 34, 36, 38-42, 
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44, 47-48, 51, 53, 55-6^ 64, 67, 71-76, 78-79, 81-94, 96-100. 

AGcx>rdmg to a further aspect of the present invention, nucleic add molecules are provided which are 
identical to the nudeic add sequences set forth with Seq ID No 9, 17, 24, 32, 37, 43, 52, 54> 65-66, 70, 80. 

The nudeic add molecules according to the present invention can as a second alternative also be a nudeic 
add molecule which is at least essentially complementary to the nudeic add described as the first 
alternative above. As used herein complementary means that a nudeic add strand is base pairing via 
Watson-Cridc base pairing witti a second nudeic add strand. Essentially complementary as used herein 
means that the base pairing is not occurring for all of the bases of the respective strands but leaves a 
certain number or percentage of the bases unpaired or wrongly paired. The percentage of correctly 
pairing bases is preferably at least 70 %, more preferably 80 %, even more preferably 90 % and most 
preferably any percentage higher than 90 %. It is to be noted that a percentage of 70 % matching bases is 
considered as homology and the hybridization having this extent of matching base pairs is considered as 
stringent. Hybridization conditions for tiiis kind of stringent hybridization may be taken from Current 
Protocols in Molecular Biology (John Wiley and Sons, Inc., 1987). More particularly, the hybridization 
conditions can be as follows: 

• Hybridization performed e.g. in 5 x ^E, 5 x Denhardt's reagent, 0.1% SDS, 100 g/mL sheared 
DNAat68**C 

• Moderate stringency wash in 0.2xSSQ 0.1% SDS at 42**C 

• High stringency wash in O.lxSSQ 0,1% SDS at 68**C 

Genomic DNA with a GC content of 50% has an approximate Tm of 96**C For 1% mismatch, the Tm is 
reduced by approximately 1**C. 

In addition, any of the further hybridization conditions described herein are in prindple applicable as 
well. 

Of course, all nudeic add sequence molecules whidi encode the same polypeptide molecule as those 
identified by the present invention are encompassed by any disdosure of a given coding sequence, since 
the degeneracy of the genetic code is directly applicable to unambiguously determine all possible nudeic 
add molecules which encode a given polypeptide molecule, even if the number of such degenerated 
nudeic add molecules may be high. This is also applicable for fragments of a given polypeptide, as long 
as the fragments encode a polypeptide being suitable to be used in a vaccination connection^ e.g. as an 
active or passive vaccine. 

The nudeic add molecule according to the present invention can as a third alternative also be a nucleic 
add which comprises a stretch of at least 15 bases of the nudeic add molecule according to the first and 
second alternative of the nuddc add molecules according to the present invention as outlined above. 
Preferably, the bases form a contiguous stretch of bases. However, it is also within the scope of the 
present invention that the stretch consists of two or more moieties, which are separated by a number of 
bases. 

The present nudeic adds may preferably consist of at least 20, even more preferred at least 30, espedally 
at least 50 contiguous bases from the sequences disdosed herein. The suitable length may easily be 
optimized due to the planned area of use (e.g. as (PGR) primers, probes, capture molecules (e.g. on 
a (DNA) chip), etc.). Preferred nudeic add molecules contain at least a contiguous 15 base portion of one 
or more of the predicted immimogenic amino add sequences listed in tables 1 and 2, especially the 
sequences of table 2 with scores of more than 10, preferably more than 20, espedaUy with a score of more 
than 25. SpedficaUy preferred are nudeic adds containing a contiguous portion of a DNA sequence of 
any sequence in the sequence protocol of tiie present application which shows 1 or more, preferably more 
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than2, especially more than 5, non-identical nudeic add residues compared to the published 
Streptococcus jmeumoniae strain TIGR4 genome ({Tettelin, H. et al., 2001}; GenBank accession AE005672) 
and/or any other published S. pneumoniae genome sequence or parts thereof, espedally of the strain R6 
({Hoskins, J. et al., 2001); GenBank accession AE007317). Specifically preferred non-identical nudeic add 
residues are residues, which lead to a non-identical amino add residue. Preferably, the nudeic add 
sequences encode for polypeptides having at least 1, preferably at least 2, preferably at least 3 
different amino add residues compared to the published S. pneumoniae counterparts mentioned above. 
Also such isolated polypeptides, being fragments of the proteins (or the whole protein) mentioned herein 
e,g. in the sequence listing, having at least 6, 7, or 8 amino add residues and being encoded by these 
nudeic adds are preferred. 

The nudeic add molecule according to the present invention can as a fourth alternative also be a nudeic 
add molecule which anneals under stringent hybridisation conditions to any of the nucleic acids of the 
present invention according to the above outlined first, second, and third alternative. Stringent 
hybridisation conditions are typically those described herein. 

Finally, the nudeic add molecule according to the present invention can as a fifth alternative also be a 
nudeic add molecule whidv but for ihe degeneracy of the genetic code, would hybridise to any of the 
nudeic add molecules according to any nudeic add molecule of the present invention according to the 
firsts second, third, and fourth alternative as outlined above. This kind of nudeic add molecule refers to 
the fact that preferably tfie nudeic adds according to the present invention code for the hyperimmime 
serum reactive antigens or fragments thereof according to tfie present invention. This kind of nudeic add 
molecule is particularly useful in the detection of a nudeic add molecule according to the present 
invention and thus the diagnosis of the respective microorganisms such as S. pneumoniae and any disease 
or diseased condition where this kind of microorganims is involved. Preferably, the hybridisation would 
occur or be preformed under stringent conditions as described in connection with the fourth alternative 
described above. 

Nudeic add molecule as used herein generally refers to any ribonudeic acid molecule or 
deoxyribonudeic add molecule, which may be unmodified RNA or DNA or modified RNA or DNA. 
Thus, for instance, nudeic add molecule as used herein refers to, among other, single-and double- 
stranded DNA, DNA that is a mbcture of single- and double-stranded RNA, and RNA that is a mixture of 
single- and double-stranded regions, hybrid molecules comprising DNA and RNA ihat may be single- 
stranded or, more typically, double-stranded, or triple-stranded, or a mixture of single- and double- 
stranded regions. In addition, nudeic add molecule as used herein refers to triple-stranded regions 
comprising RNA or DNA or botfi RNA and DNA, The strands in such regions may be from the same 
molecule or from different molecules. The regions may indude all of one or more of the molecules, but 
more typicaUy involve only a region of some of the molecules. One of the molecules of a triple-helical 
region often is an oligonudeotide. As used herein, the term nudeic add molecule indudes DNAs or 
RNAs as described above that contain one or more modified bases. Thus, DNAs or RNAs with backbones 
modified for stabiUty or for other reasons are "nudeic add molecule" as that term is intended herein. 
Moreover, DNAs or RNAs comprising unusual bases, such as inosine, or modified bases, such as 
tritylated bases, to name just two examples, are nudeic add molecule as the term is used herein. It wiU be 
appreciated that a great variety of modifications have been made to DNA and RNA that serve many 
useful purposes known to those of skUl in the art. The term nudeic add molecule as it is employed herein 
embraces such chemicaUy, enzymaticaUy or metabolically modified forms of nudeic add molecule, as 
well as the chemical forms of DNA and RNA characteristic of viruses and cells, including simple and 
complex cells, inter alia. The term nudeic add molecule also embraces short nudeic add molecules often 
referred to as oHgonudeotide(s). Tolynudeotide" and "nudeic add" or "nucleic add molecule" are often 
used interdumgeably herein. 



add molecules provided in the present invention also encompass numerous unique fiagments. 



wo 2004/092209 



PCT/EP2004/003984 



-20- 

both longer and shorter than the nucleic add molecule sequences set forth in the sequencing listing of the 
S. pneumoniae coding regions, which can be generated by standard cloning methods. To be unique, a 
fragment must be of sufficient size to distinguish it from other known nucleic add sequences, most 
readily determined by comparing any selected 5. pneumoniae fragment to the nudeotide sequences in 
computer databases such as GenBank. 

Additionally, modifications can be made to the nudeic add molecules and polypeptides that are 
encompassed by the present invention. For example, nudeotide substitutions can be made which do not 
affect the polypeptide encoded by the nudeic add, and thus any nudeic add molecule which encodes a 
hyperimmime serum reactive antigen or fragments thereof is encompassed by the present invention. 

Furthermore, any of the nucleic add molecules encoding hyperimmune serum reactive antigens or 
fragments thereof provided by tfie present invention can be functionally linked, using standard 
techniques such as standard doning techniques, to any desired regulatory sequences, whether a S. 
pneumoniae regulatory sequence or a heterologous regulatory sequence, heterologous leader sequence, 
heterologous marko: sequence or a heterologous coding sequence to create a fusion protein. 

Nudeic add molecules of the present invention may be in the form of RNAf such as mRNA or c!RNA, or 
in the form of DNA, induding, for instance, cDNA and genomic DNA obtained by doning or produced 
by chemical synthetic techniques or by a combination thereof. The DNA may be triple-stranded, double- 
stranded or single-stranded. Single-stranded DNA may be the coding strand, also known as the sense 
strand, or it may be the non-coding strand, also referred to as tiie anti-sense strand. 

The present invention further relates to variants of the herein above described nudeic add molecules 
which encode fragments, analogs and derivatives of the hyperimmune serum reactive antigens and 
fragments thereof having a deducted S. pneumoniae amino add sequence set forth in the Sequence Listing. 
A variant of the nudeic add molecule may be a naturally occurring variant sudi as a naturally occurring 
allelic variant, or it may be a variant that is not known to occur naturally. Such non-naturaliy occurring 
variants of the nudeic add molecule may be made by mutagenesis techniques, induding those applied to 
nucleic acid molecules, cells or organisms. 

Among variants in this regard are variants that differ from the aforementioned nudeic add molecules by 
nudeotide substitutions, deletions or additions. The substitutions, deletions or additions may involve one 
or more nudeotides. The variants may be sdtered in coding or non-coding regions or both. Alterations in 
the coding regions may produce conservative or non-conservative amino add substitutions, deletions or 
additions. Preferred are nudeic add molecules encoding a variant, analog, derivative or fragment, or a 
variant, analogue or derivative of a fragment, which have a S. pneumoniae sequence as set forth in the 
Sequence Listing, in which several, a few, 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino add(s) is substituted, 
deleted or added, in any combination. Especially preferred among these are silent substitutions, additions 
and deletions, whidi do not alter the properties and activities of the S. pneumoniae polypeptides set forth 
in the Sequence Listing. Also especially preferred in this regard are conservative substitutions. 

The peptides and fragments according to the present invention also include modified epitopes wherein 
preferably one or two of the amino adds of a given epitope are modified or replaced according to the 
rules disdosed in e.g. {Tourdot, S. et al., 2000}, as well as tiie nudeic add sequences encoding such 
modified epitopes. 

It is dear that also epitopes derived from the present epitopes by amino add exdtanges improving, 
conserving or at least not significantly impeding the T cell activating capability of the epitopes are 
covered by the ^itopes according to the present invention. Therefore the present epitopes also cover 
epitopes, whidi do not contain the original sequence as derived from S. pneumoniae, but trigger the same 
or preferably an improved T cell response. These epitope are referred to as "heteroditic"; they need to 
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have a similar or preferably greater affinity to MHC/HLA molecules, and the need the ability to stimulate 
the T cell receptors (TCR) directed to the original epitope in a similar or preferably stronger manner. 

Heteroditic epitopes can be obtained by rational design i.e. taking into account the contribution of 
individual residues to binding to MHC/HLA as for instance described by {Rammensee, H. et al., 1999}, 
combined with a systematic exchange of residues potentially interacting with the TCR and testing the 
resulting sequences with T cells directed against the original epitope. Such a design is possible for a 
skilled man in the art without much experimentation. 

Another possibility includes the screening of peptide libraries with T cells directed against the original 
epitope. A preferred way is the positional scanning of synthetic peptide libraries. Such approaches have 
been described in detail for instance by {Hemmer, B. et al., 1999)and the references given therein. 

As an alternative to epitopes represented by the present derived amino acid sequences or heteroditic 
epitopes, also substances mimicking these epitopes e.g. "peptidemimetica" or *'retro-inverso-peptides" can 
be applied. 

Another aspect of the design of improved epitopes is their formulation or modification with substances 
increasing their capadty to stimulate T cells. These indude T hdper cdl epitopes, lipids or liposomes or 
preferred modifications as described in WO 01/78767. 

Another way to increase the T cell stimulating capadty of epitopes is their formulation with immune 
stimulating substances for instance cytokines or chemokines like interleukin-2, -7, -12, -18, dass I and II 
interferons (IFN), espedally IFN-gamma, GM-CSF, TNF-alpha, flt3-ligand and others. 

As discussed additionally herein regarding nudeic add molecule assays of the invention, for instance, 
nudeic add molecules of the invention as discussed above, may be used as a hybridization probe for 
RNA, cDNA and genomic DNA to isolate full-length cDNAs and genomic dones encoding polypeptides 
of the present invention and to isolate cDNA and genomic dones of other genes that have a high 
sequence similarity to the nucleic add molecules of the present invention. Such probes generally will 
comprise at least 15 bases. Preferably, such probes will have at least 20, at least 25 or at least 30 bases, and 
may have at least 50 bases. Particularly preferred probes will have at least 30 bases, and will have 50 
bases or less, such as 30, 35, 40, 45, or 50 bases. 

For example, the coding region of a nuddc add molecule of the present invention may be isolated by 
screening a relevant library using the known DNA sequence to synthesize an oligonudeotide probe. A 
labeled oligonudeotide having a sequence complementary to that of a gene of the present invention is 
then used to screen a library of cDNA, genomic DNA or mRNA to determine to which members of the 
library the probe hybridizes. 

The nucleic add molecules and polypeptides of the present invention may be employed as reagents and 
materials for devdopment of treatments of and diagnostics for disease, particularly human disease, as 
further discussed herein relating to nudeic add molecule assa5r8, inter alia. 

The nucleic add molecules of the present invention that are oligonudeotides can be used in the processes 
herein as described, but preferably for PGR, to determine whether or not the S. pneumoniae genes 
identified herein in whole or in part are present and/or transcribed in infected tissue such as blood. It is 
recognized that such sequences will also have utility in diagnosis of the stage of infection and type of 
infection the pathogen has attained. For this and other purposes the arrays comprising at least one of the 
nudeic adds according to the present invention as described herein, may be used. 
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The nucleic add molecules according to the present invention may be used for the detection of nucleic 
add molecules and organisms or samples containing these nudeic adds. Preferably such detection is for 
diagnosis, more preferable for the diagnosis of a disease related or linked to the present or abundance of 
S. pneumoniae, 

Eukaryotes (herein also "individual(s)"), particularly mammals, and espedally humans, infected with S. 
pneumoniae may be identifiable by detecting any of the nucleic add molecules according to the present 
invention detected at the DNA level by a variety of techniques. Preferred nudeic add molecules 
candidates for distinguishing a 5. pneumoniae from other organisms can be obtained. 

The invention provides a process for diagnosing disease, arising from infection with S. pneumoniae, 
comprising determining from a sample isolated or derived from an individual an increased level of 
expression of a nudeic add molecule having the sequence of a nudeic add molecule set forth in the 
Sequence Listing. Expression of nudeic add molecules can be measured using any one of fhe methods 
well known in the art for the quantitation of nuddc add molecules, such as, for example, PGR, RT-PCR, 
Rnase protection. Northern blotting, other hybridisation methods and the arrays described herein. 

Isolated as used herein means sq^arated "by the hand of man" from its natural state; i.e., that, if it occurs 
in nature, it has been chamged or removed from its original environment, or both. For example, a 
naturally occurring nudeic add molecule or a polypeptide naturally present in a living organism in its 
riatural state is not "isolated," but the same nudeic add molecule or poljrpeptide separated from the 
coexisting materials of its natural state is "isolated", as the term is employed herein. As part of or 
following isolation, such nudeic add molecules can be joined to other nudeic add molecules, such as 
DNAs, for mutagenesis, to form fusion proteins, and for propagation or expre^ion in a host, for instance. 
The isolated nudeic add molecules, alone or joined to other nudeic add molecules such as vectors, can be 
introduced into host cells, in culture or in whole organisms. Introduced into host cells in culture or in 
whole organisms, such DNAs still would be isolated, as the term is used herein, because they would not 
be in their naturally occurring form or environment. Similarly, the nudeic add molecules and 
polypeptides may occur in a composition, such as a media formulations, solutions for introduction of 
nudeic add molecules or polypeptides, for example, into cells, compositions or solutions for chemical or 
enzymatic reactions, for instance, which are not naturally occurring compositions, and, therein remain 
isolated nudeic add molecules or polypeptides within the meaning of that term as it is employed herein. 

The nudeic adds according to the present invention may be chemically synthesized. Alternatively, the 
nudeic adds can be isolated from S. pneumoniae by methods known to the one skilled in the art 

According to' another aspect of the present invention, a comprehensive set of novd hyperimmune serum 
reactive antigens and fragments thereof are provided by using the herein described antigen identification 
method. In a preferred embodiment of the invention, a h3rperimmime serum-reactive antigen comprising 
an amino add sequence being encoded by any one of the nudeic adds molecules herein described and 
fragments thereof are provided. In another preferred embodimmt of the invention a novel set of 
h3rperinunune serum-reactive antigens which comprises amino add sequences selected from a group 
consisting of the polypeptide sequences as represented in Seq ID No 145, 245-288 and fragments thereof 
are provided. In a further preferred embodiment of the invention hyperimmune serum-reactive antigens 
which comprise amino add sequences selected from a group consisting of the polypeptide sequences as 
represented in Seq ID No 146-150, 152, 154-160, 162-167, 169-175, 178, 180, 182-186, 188, 191-192, 195, 197, 
199-206, 208, 211, 215-220, 222-223, 225-238, 240-244 and fragments thereof are provided. In a still 
preferred embodiment of the invention hyperimmune serum-reactive antigens which comprise amino 
add sequences selected from a group consisting of the polypeptide sequences as represented in Seq ID 
No 153, 161, 168, 176, 181, 187, 196, 198, 209-210, 214, 224 and fragments tiiereof are provided. 

The hyperimmune serum reactive antigens and fragments thereof as provided in the invention indude 
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any polypeptide set forth in the Sequence Listing as well as pol3rpeptides which have at least 70% identity 
to a polypeptide set forth in the Sequence Listing, preferably at least 80% or 85% identity to a polypeptide 
set forth in the Sequence Listing, and more preferably at least 90% similarity (more preferably at least 
90% identity) to a polypeptide set forth in ilie Sequence Listing and still more preferably at least 95%, 
96%, 97%, 98%, 99% or 993% similarity (stiU more preferably at least 95%, 96%, 97%, 98%, 99%, or 99.5% 
identity) to a polypeptide set forth in the Sequence Listing and also include portions of such polypeptides 
with such portion of the polypeptide generally containing at least 4 amino adds and more preferably at 
least 8, still more preferably at least 30, still more preferably at least 50 amino adds, such as 4, 8, 10, 20, 
30, 35, 40, 45 or 50 amino adds. 

The invention also relates to fragments, analogs, and derivatives of these hyperimmune serum reactive 
antigens and fragments thereof. The terms "fragment", "derivative" and "analog" when referring to an 
antigen whose amino add sequence is set forth in the Sequence Listing, means a pol5rpeptide which 
retains essentially the same or a similar biological function or activity as such hyperimmime serum 
reactive antigen and fragment thereof. 

The fragment; derivative or analog of a hj^erimmune serum reactive antigen and fragment thereof may 
be 1) one in which one or more of the amino add residues are substituted witit a conserved or non- 
conserved amino add residue (preferably a conserved amino add residue) and such substituted amino 
add residue may or may not be one encoded by the genetic code, or 2) one in which one or more of the 
amino add residues indudes a substituent group, or 3) one in which the mature hjperimmxme serum 
reactive antigen or fragment thereof is fused with another compound, such as a compoimd to increase the 
half-life of the hyperimmune serum reactive antigen and fragment thereof (for example, polyethylene 
glycol), or 4) one in whidi the additional amino adds are fused to the mature hyperimmune serum 
reactive antigen or fragment thereof, such as a leader or secretory sequence or a sequence which is 
employed for piirification of the mature h3rperimmime serum reactive antigen or fragment thereof or a 
proprotein sequence. Such fragments, derivatives and analogs are deemed to be within the scope of 
those skilled in the art from the teachings herein. 

The present invention also relates to antigens of different S. pneumoniae isolates. Such homologies may 
easily be isolated based on the nucleic add and amino add sequences disdosed herein. There are more 
than 90 serotypes in more than 40 serogroups distinguished to date and the typing is based on serotsrpe 
specific antisera. The presence of any antigen can accordingly be determined for every serot3rpe. In 
addition it is possible to determine the variability of a particular antigen in the various serotypes as 
described for the S. pyogenes sic gene {Hoe, N. et al., 2001). The contribution of the various serotypes to 
ttie different pneumococcal infections varies in the different age groups and geographical regions {Gray, 
B. et al., 1979}; {Gray, B. et al., 1986); {Orange, M. et al., 1993}, reviewed in Epidemiology and Prevention 
of Vacdne-Preventable Diseases, 7th Edition-Second Printing, The Pink Book). It is an important aspect 
that the most valuable protective antigens are expected to be conserved among various clinical strains. 

Among the particularly preferred embodiments of the invention in this regard are the hyperimmune 
serum reactive antigens set fortti in the Sequence Listing, variants, analogs, derivatives and fragments 
thereof, and variants, analogs and derivatives of fragments. Additionally, fusion polypeptides 
comprising such hyperimmime serum reactive antigens, variants, analogs, derivatives and fragments 
thereof, and variants, analogs and derivatives of the fragments are also encompassed by the present 
invention. Such fusion polj^eptides and proteins, as well as nuddc add molecules encoding them, can 
readily be made using standard techniques, induding standard recombinant techniques for producing 
and expression of a recombinant polynudeic add encoding a fusion protein. 

Among preferred veuiants are those that vary from a referoice by conservative amino add substitutions. 
Sudi substitutions are those that substitute a given amino add in a pols^peptide by another amino add of 
like characteristics. Typically seen as conservative substitutions are the replacements, one for another. 
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among the aliphatic amino adds Ala, Val, Leu and lie; interchange of the hydroxyl residues Ser and Hir, 
exchange of the acidic residues Asp and Glu, substitution between the amide residues Asn and Gin, 
exchange of the basic residues Lys and Arg and replacements among the aromatic residues Phe and Tyr. 

Further particularly preferred in tfiis regard are variants, analogs, derivatives and fragments, and 
variants, analogs and derivatives of the fragments, having the amino add sequence of any polypeptide 
set forth in the Sequence Listing, in which several, a few, 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino add 
residues are substituted, deleted or added, in any combination. Espedally preferred among these are 
silent substitutions, additions and deletions, which do not alter the properties and activities of the 
polypeptide of the present invention. Also espedally preferred in this regard are conservative 
substitutions. Most highly preferred are polypeptides having an amino add sequence set forth in the 
Sequence Listing wittiout substitutions. 

The hyperimmtme serum reactive antigens and fragments thereof of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 

Also among preferred embodiments of the present invention are polypeptides comprising fragments of 
the pol3^eptides having the amino add sequence set fortti in ttie Sequence Listing, and fragments of 
variants and derivatives of the polypeptides set forth in the Sequence Listing. 

In this regard a fragment is a poljrpeptide having an amino add sequence that entirely is the same as part 
but not all of the amino add sequence of the afore mentioned hyperimmime serum reactive antigen and 
fragment thereof, and variants or derivative, analogs, fragments thereof. Such fragments may be "free- 
standing'^ Le., not part of or fused to other amino adds or polj^eptides, or they may be comprised 
within a larger polypeptide of which they form a part or region. Also preferred in this aspect of the 
invention are fragments characterised by structural or functional attributes of the polypeptide of the 
present invention, Le. fragments that comprise alpha-helix and alpha-helix forming regions, beta-sheet 
and beta-sheet forming regions, turn and turn-forming regions, coil and coil-forming regions, hydrophilic 
regions, hydrophobic regions, alpha amphipathic regions, beta-amphipathic regions, flexible regions, 
surface-forming regions, substrate binding regions, and high antigenic index regions of the polypeptide 
of the present invention, and combinations of such fragments. Preferred regions are those that mediate 
activities of the hyperimmune serum reactive antigens and fragments thereof of the present invention. 
Most hig^y preferred in this regard are fragments tfiat have a diemical, biological or other activity of the 
hyperimmxme serum reactive antigen and fragments thereof of the present invention, induding those 
with a similar activity or an improved activity, or with a decreased undesirable activity. Particularly 
preferred are fragments comprising receptors or domains of enzymes that confer a function essential for 
viability of S. pneumoniae or the ability to cause disease in humans. Further preferred polypeptide 
fragments are those that comprise or contain antigenic or immimogenic determinants in an animal, 
espedally in a human. 

An antigenic fragment is defined as a fragment of the identified antigerv which is for itself antigenic or 
may be made antigenic when provided as a hapten. Therefore, also antigens or antigenic fragments 
showing one or (for longer fragments) only a few amino add exchanges are enabled with the present 
invention, provided that the antigenic capadties of such fragments with amino add exchanges are not 
severely deteriorated on the exchange(s), Le., suited for elidting an appropriate immune response in an 
individual vaccinated with this antigen and identified by individual antibody preparations from 
individual sera. 



Preferred ©camples of such fragments of a hyperimmune serum-reactive antigen are selected from the 
group consisting of peptides comprising amino add sequences of column "predicted immunogenic aa", 
and 'Xocation of identified immunogenic region" of Table 1; the serum reactive epitopes of Table 2, 
espedally peptides comprising amino add 4-11, 35-64, 66-76, 101-108, 111-119 and 57-114 of Seq ID No 
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145; 5-27, 32-64, 92-102, 107-113, 119-125, 133-139, 148-162, 177-187, 195-201, 207-214, 241-251, 254-269, 
285-300, 302-309, 317-324, 332-357, 365-404, 411-425, 443-463, 470-477, 479-487, 506-512, 515-520, 532-547, 
556-596, 603-610, 616-622, 624-629, 636-642, 646^65, 667-674, 687-692, 708-720, 73^739, 752-757, 798-820, 
824-851, 856-865 and 732-763 of Seq ID No 146; 14-21, 36-44> 49-66, 102-127, 162-167, 177-196, 45-109 and 
145-172 of Seq ID No 147; 17-35, 64-75, 81-92, 100-119, 125-172, 174-183, 214-222, 230-236, 273-282, 287- 
303, 310-315, 331-340, 392-398, 412-420, 480-505, 515-523, 525-546, 553-575, 592-598, 603-609, 617-625, 631- 
639, 644-651, 658-670, 681-687, 691-704, 709-716, 731-736, 739-744, 750-763, 774-780, 784-791, 799-805, 809- 
822, 859-870, 880-885, 907-916, 924-941, 943-949, 973-986, 1010-1016, 1026-1036, 1045-1054, 1057-1062, 1082- 
1088, 1095-1102, 1109-1120, 1127-1134, 1140-1146, 1152-1159, 1169-1179, 1187-1196, 1243-1251, 1262-1273, 
1279-1292, 1306-1312, 1332-1343, 1348-1364, 1379-1390, 1412-1420, 1427-1436, 1458-1468, 1483-1503, 1524- 
1549, 1574-1588, 1614-1619, 1672-1685, 1697-1707, 1711-1720, 1738-1753, 1781-1787, 1796-1801, 1826-1843, 
132^78, 508-592 and 1753-1810 of Seq ID No 148; 15-43, 49-55, 71-77, 104-110, 123-130, 162-171, 180-192, 
199-205, 219-227, 246-254, 264-270, 279-287, 293-308, 312-322, 330-342, 349-356, 369-377, 384-394, 401-406, 
416-422, 432-439, 450-460, 464-474. 482-494, 501-508, 521-529, 536-546, 553-558, 568-574, 584-591, 602-612, 
616-626, 634-646, 653-660, 673-681, 688-698, 705-710, 720-726, 736-749, 833-848, 1-199, 200-337, 418-494 and 
549-647 of Seq ID No 149; 9-30, 65-96, 99-123, 170-178 and 1-128 of Seq ID No 150; 7-32, 34-41, 96-106, 
127-136, 154-163, 188-199, 207-238, 272-279, 306^12, 318-325, 341-347, 353-360, 387-393, 399-406, 434-440, 
452-503, 575-580, 589-601, 615-620, 635-640, 654-660, 674-680, 696-701, 710-731, 1-548 and 660-691 of Seq 
ID No 151; 4-19, 35-44, 48-59, 77-87, 93-99, 106-111, 130-138, 146-161 and 78-84 of Seq ID No 152; 24-30, 
36-43, 64-86, 93-99, 106-130, 132-145, 148-165, 171-177, 189-220, 230-249, 251-263, 293-300, 302-312, 323-329, 
338-356, 369-379, 390-412 and 179-193 of Seq ID No 153; 30-39, 61-67, 74-81, 90-120, 123-145, 154-167, 169- 
179, 182-197, 200-206, 238-244, 267-272 and 230-265 of Seq ID No 154; 14-20, 49-65, 77-86 and 2-68 of Seq 
ID No 155; 4-9, 26-35, 42-48, 53-61, 63-85, 90-101, 105-111, 113-121, 129-137, 140-150, 179-188, 199-226, 228- 
237, 248-255, 259-285, 299-308, 314-331, 337-343, 353-364, 410-421, 436-442 and 110-144 of Seq ID No 156; 
36-47, 55-63, 94-108, 129-134, 144-158, 173-187, 196-206, 209-238, 251-266, 270-285, 290-295, 300-306, 333- 
344, 346-354, 366-397, 404410, 422-435, 439-453, 466-473, 515^23, 529-543, 554-569, 571-585, 590-596, 607- 
618, 627-643, 690-696, 704-714, 720-728, 741-749, 752-767, 780-799, 225-247 and 480-507 of Seq ID No 157; 
16-25, 36-70, 80-93, 100-106 and 78-130 of Seq ID No 158; 18-27, 41-46, 50-57, 65-71, 79-85, 93-98, 113-128, 
144-155, 166-178, 181-188, 201-207, 242-262, 265-273, 281-295, 303-309, 318-327 and 36-64 of Seq ID No 159; 
7-29, 31-44, 50-59, 91-96, 146-153, 194-201, 207-212, 232-238, 264-278, 284-290, 296-302, 326-353, 360-370, 
378-384, 400-405, 409-418, 420-435, 442-460, 499-506, 529-534, 556-562, 564-576, 644-651, 677-684, 687-698, 
736-743, 759-766, 778-784, 808-814, 852-858, 874-896, 920-925, 929-935, 957-965, 1003-1012, 1021-1027, 1030- 
1044, 1081-1087, 1101-1111, 1116-1124, 1148-1159, 1188-1196, 1235-1251, 1288-1303, 1313-1319, 1328-1335, 
1367-1373, 1431-1437, 1451-1458, 1479-1503, 1514-1521, 1530-1540, 1545-1552, 1561-1568, 1598-1605, 1617- 
1647, 1658-1665, 1670-1676, 1679-1689, 1698-1704, 1707-1713, 1732-1738, 1744-1764, 1-70, 154-189, 922-941, 
1445-1462 and 1483-1496 of Seq ID No 160; 6-51, 81-91, 104-113, 126-137, 160-159, 164-174, 197-209, 215- 
224, 229-235, 256-269, 276-282, 307-313, 317-348, 351-357, 376-397, 418-437, 454-464> 485-490, 498-509, 547- 
555, 574-586, 602-619 and 452-530 of Seq ID No 161; 25-31, 39-47, 49-56, 99-114, 121-127, 159-186, 228-240, 
253-269, 271-279, 303-315, 365-382, 395-405, 414-425, 438-453 and 289-384 of Seq ID No 162; 9-24, 41-47, 
49-54, eS-lB, 108-114, 117-122, 132-140, 164-169, 179-186, 193-199, 206-213, 244-251, 267-274, 289-294, 309- 
314, 327-333, 209-249 and 286-336 of Seq ID No 163; 9-28, 53-67, 69-82, 87-93, 109-117, 172-177, 201-207, 
220-227, 242-247, 262-268, 305-318, 320-325 and 286-306 of Seq ID No 164; 4-10, 26-39, 47-58, 63-73, 86^6, 
98-108, 115-123, 137-143, 148-155, 160-176, 184-189, 194-204, 235-240, 254-259, 272-278 and 199-283 of Seq 
ID No 165; 4-26, 33-39, 47-53, 59-65, 76-83, 91-97, 104-112, 118-137, 155-160, 167-174, 198-207, 242-268, 273- 
279, 292-315, 320^2, 345-354, 358-367, 377-394, 403-410, 424-439, 445-451, 453-497, 511-518, 535-570, 573- 
589, 592-601, 604-610 and 202-242 of Seq ID No 166; 8-30, 36-45, 64-71, 76-82, 97-103, 105-112, 134-151, 
161-183, 211-234, 253-268, 270-276, 278-284, 297-305, 309-315, 357-362, 366-372, 375-384, 401-407, 409-416, 
441-455, 463-470, 475-480, 490-497, 501-513, 524-537, 552-559, 565-576, 581-590, 592-600, 619-625, 63S-644, 
646-656 and 316-419 of Seq ID No 167; 4-17, 52-58, 84-99, 102-110, 114-120, 124-135, 143-158, 160-173, 177- 
196, 201-216, 223-250, 259-267, 269-275 and 1-67 of Seq ID No 168; 6-46, 57-67, 69-80, 82-133, 137-143, 147- 
168, 182-187, 203-209, 214-229, 233-242, 246-280 and 53-93 of Seq ID No 169; 7-40, 50-56, 81-89, 117-123, 
202-209, 213-218, 223-229, 248-261, 264-276, 281-288, 303-308, 313-324, 326-332, 340-346, 353-372, 434-443, 
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465-474, 514-523, 556-564, 605-616, 620-626, 631-636, 667-683, 685-699, 710-719, 726-732, 751-756, 760-771, 
779-7m, 815-828, 855-867, 869-879, 897-902, 917-924, 926-931, 936-942, 981-1000, 1006-1015, 1017-1028, 
1030-1039, 1046-1054, 1060-1066, 1083-1092, 1099-1112, 1122-1130, 1132-1140, 1148-1158, 1161-1171, 1174- 
1181, 1209-1230, 1236-1244, 1248-1254, 1256-1267, 1269-1276, 1294-1299, 1316-1328, 1332-1354, 1359-1372, 
1374-1380, 1384-1390, 1395-1408, 1419-1425, 1434-1446, 1453-1460, 1465-1471, 1474-1493, 1505-1515, 1523- 
1537, 1547-1555, 1560-1567, 1577-1605, 1633-1651, 1226-1309, 1455-1536 and 1538-1605 of Seq ID No 170; 

4- 10, 31-39, 81-88, 106-112, 122-135, 152-158, 177-184, 191-197, 221-227, 230-246, 249-255, 303-311, 317-326, 
337-344, 346-362, 365-371, 430437, 439^46, 453-462, 474-484 and 449-467 of Seq ID No 171; 9-15, 24-35, 
47-55, 122-128, 160-177, 188-196, 202-208, 216-228, 250-261, 25^-303, 318-324, 327-339, 346-352, 355-361, 
368-373, 108-218 and 344-376 of Seq ID No 172; 6-14, 17-48, 55-63, 71-90, 99-109, 116-124, 181-189, 212-223, 
232-268, 270-294, 297-304, 319-325, 340-348, 351-370, 372-378, 388^94, 406-415, 421-434 and 177-277 of Seq 
ID No 173; 21-39, 42-61, 65-75, 79-85, 108-115 and 11-38 of Seq ID No 174; 4-17, 26-39, 61-76, 103-113, 115- 
122, 136-142, 158-192, 197-203, 208-214, 225-230, 237-251 and 207-225 of Seq ID No 175; 5-11, 27-36, 42-53, 
62-70, 74-93, 95-104, 114-119, 127-150, 153-159, 173-179, 184-193, 199-206, 222-241, 248-253, 257-280, 289- 
295, 313-319, 322-342, 349-365, 368-389, 393-406, 408-413, 426-438, 447-461, 463-470, 476-495, 532-537, 543- 
550 and 225-246 of Seq ID No 176; 4-29, 68-82, 123-130, 141-147, 149-157, 178-191, 203-215, 269-277, 300- 
307, 327-335, 359-370, 374-380, 382-388, 393-400, 410-417, 434-442, 483-492, 497-503, 505-513, 533-540, 564- 
569, 601-607, 639-647, 655-666, 693-706, 712-718, 726-736, 752-758, 763-771, 774-780, 786-799, 806-812, 820- 
828, 852-863, 884-892, 901-909, 925-932, 943-948, 990-996, 1030-1036, 1051-1059, 1062-1068, 1079-1086, 1105- 
1113, 1152-1162, 1168-1179, 1183-1191, 1204-1210, 1234-1244, 1286-1295, 1318-1326, 1396-1401, 1451-1460, 
1465-1474, 1477-1483, 1488-1494, 1505-1510, 1514-1521, 1552-1565, 1593-1614, 1664-1672, 1677-1685, 1701- 
1711, 1734-1745, 1758-1770, 1784-1798, 1840-1847, 1852-1873, 1885-1891, 1906-1911, 1931-1939, 1957-1970, 
1977-1992, 2014-2020, 2026-2032, 2116-2134, 1-348, 373-490, 573-767, 903-1043, 1155-1198, 1243-1482, 1550- 
1595, 1682-1719, 1793-1921 and 2008-2110 of Seq ID No 177; 10-35, 39-52, 107-112, 181-188, 226-236, 238- 
253, 258-268, 275-284, 296-310, 326-338, 345-368, 380-389, 391-408, 410-418, 420^9, 444-456, 489-505, 573- 
588, 616-623, 637-643, 726-739, 741-767, 785-791, 793-803, 830-847, 867-881, 886-922, 949-956, 961-980, 988- 
1004, 1009-1018, 1027-1042, 1051-1069, 1076-1089, 1108-1115, 1123-1135, 1140-1151, 1164-1179, 1182-1191, 
1210-1221, 1223-1234, 1242-1250, 1255-1267, 1281-1292, 1301-1307, 1315-1340, 1348-1355, 1366-1373, 1381- 
1413, 1417-1428, 1437-1444, 1453-1463, 1478-1484, 1490-1496, 1498-1503, 1520-1536, 1538-1546, 1548-1570, 
1593-1603, 1612-1625, 1635-1649, 1654-1660, 1670-1687, 1693-1700, 1705-1711, 1718-1726, 1729-1763, 1790- 
1813, 1871-1881, 1893-1900, 1907-1935, 1962-1970, 1992-2000, 2006-2013, 2033-2039, 2045-2051, 2055-2067, 
2070-2095, 2097-2110, 2115-2121, 2150-2171, 2174-2180, 2197-2202, 2206-2228 and 1526-1560 of Seq ID No 
178; 4-17, 35-48, 54-76, 78-107, 109-115, 118-127, 134-140, 145-156, 169-174, 217-226, 232-240, 256-262, 267- 
273, 316-328, 340-346, 353-360, 402-409, 416-439, 448-456, 506-531, 540-546, 570-578, 586-593, 595-600, 623- 
632, 662-667, 674-681, 689-705, 713-724, 730-740, 757-763, 77^-77%, 783-796, 829-835, 861-871, 888-899, 907- 
939, 941-955, 957-969, 986-1000, 1022-1028, 1036-1044, 1068-1084, 1095-1102, 1118-1124, 1140-1146, 1148- 
1154, 1168-1181, 1185-1190, 1197-1207, 1218-1226, 1250-1270, 1272-1281, 1284-1296, 1312-1319, 1351-1358, 
1383-1409, 1422-1428, 1438-1447, 1449-1461, 1482-1489, 1504-1510, 1518-1527, 1529-1537, 1544-1551, 1569- 
1575, 1622-1628, 1631-1637, 1682-1689, 1711-1718, 1733-1740, 1772-1783, 1818-1834, 1859-1872, 1-64 and 
128-495 of Seq ID No 179; 8-28, 32-37, 62-69, 119-125, 137-149, 159-164, 173-189, 200-205, 221-229, 240-245, 
258-265, 268-276, 287-293, 296-302, 323^29 and 1-95 of Seq ID No 180; 9-18, 25-38, 49-63, 65-72, 74-81, 94- 
117, 131-137, 139-146, 149-158, 162-188, 191-207, 217-225, 237-252, 255-269, 281-293, 301-326, 332-342, 347- 
354, 363-370, 373-380, 391-400, 415-424, 441-447 and 75-107 of Seq ID No 181; 4-24, 64-71, 81-87, 96-116, 
121-128, 130-139, 148-155, 166-173, 176-184, 203-215, 231-238, 243-248, 256-261, 280-286, 288-306, 314-329 
and 67-148 of Seq ID No 182; 4-10, 19-37, 46-52, 62-81, 83-89, 115-120, 134-139, 141-151, 168-186, 197-205, 
209-234, 241-252, 322-335, 339-345, 363-379, 385-393, 403-431, 434-442, 447-454, 459-465, 479-484, 487-496 
and 404-420 of Seq ID No 183; 10-35, 46-66, 71-77, 84-93, 96-122, 138-148, 154-172, 182-213, 221-233, 245- 
263, 269-275, 295-301, 303-309, 311-320, 324^336, 340-348, 351-359, 375-381 and 111-198 of Seq ID No 184; 
14-25, 30-42, 47-61, 67-75, 81-91, 98-106, 114-122, 124-135, 148-193, 209-227 and 198-213 of Seq ID No 185; 

5- 18, 45-50, 82-90, 97-114;, 116-136, 153-161, 163-171, 212-219, 221-227, 240-249, 267-281, 311-317, 328-337, 
375-381, 390-395, 430-436, 449-455, 484-495, 538-543, 548-554, 556-564, 580-586, 596-602 and 493-606 of Seq 
ID No 186; 9-25, 28-3^ 37-44, 61-68, 75-81, 88-96, 98-111, 119-133, 138-150, 152-163, 168-182, 186-194, 200- 
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205, 216-223, 236-245, 257-264, 279-QJS7, 293-304, 311-318, 325-330, 340-346, 353-358, 365-379, 399-409, 444- 
453 and 303-391 of Seq ID No 187; 16-36, 55-61, 66-76, 78-102, 121-130, 134-146, 150-212, 221-239, 255-276, 
289-322, 329-357 and 29-59 of Seq ID No 188; 8-27, 68-74, 77-99, 110-116, 124-141, 171-177, 202-217, 221- 
228, 259-265, 275-290, 293-303, 309-325, 335-343, 345-351, 365-379, 384-394, 406-414, 423-437, 452-465, 478- 
507, 525-534, 554-560, 611-624, 628-651, 669-682, 742-747, 767-778, 782-792, 804^12, 820-836, 79-231 and 
359-451 of Seq ID No 189; 5-28, 39-45, 56-62, 67-74, 77-99, 110-117, 124-141, 168-176, 200-230, 237-244, 268- 
279, 287-299, 304-326, 329-335, 348-362, 370-376, 379-384, 390-406, 420-429, 466-471, 479-489, 495-504, 529- 
541, 545-553, 561-577, 598-604, 622-630, 637-658, 672-680, 682-688, 690-696, 698-709, 712-719, 724-736, 738- 
746, 759-769, 780-786, 796-804, 813-818, 860-877, 895-904, 981-997, 1000-1014, 1021-1029, 1-162, 206-224, 
254-350, 414-514 and 864-938 of Seq ID No 190; 4-11, 19-49, 56-66, 68-101, 109-116, 123-145, 156-165, 177- 
185, 204-221, 226-234, 242-248, 251-256, 259-265, 282-302, 307-330, 340-349, 355-374, 377-383, 392-400, 422- 
428, 434-442, 462-474 and 266-322 of Seq ID No 191; 14-43, ^-57, €Ar7^, 80-87, 106-127, 131-142, 145-161, 
173-180, 182-188, 203-210, 213-219, 221-243, 245-254, 304-311, 314-320, 342-348, 354-365, 3:^-378, 394-399, 
407-431, 436-448, 459-465, 470-477, 484-490, 504-509, 531-537, 590-596, 611-617, 642-647, 723-734, 740-751, 
754-762, 764-774, 782-797, 807-812, 824-831, 838-845, 877-885, 892-898, 900-906, 924-935, 940-946, 982-996, 
1006-1016, 1033-1043, 1051-1056, 1058-1066, 1094-1108, 1119-1126, 1129-1140, 1150-1157, 1167-1174, 1176- 
1185, 1188-1201, 1209-1216, 1220-1228, 1231-1237, 1243-1248, 1253-1285, 1288-1297, 1299-1307, 1316-1334, 
1336-1343, 1350-1359, 1365-1381, 1390-1396, 1412-1420, 1427-1439, 1452-1459, 1477-1484, 1493-1512, 1554- 
1559, 1570-1578, 1603-1608, 1623-1630, 1654-1659, 1672-1680, 1689-1696, 1705-1711, 1721-1738, 1752-1757, 
1773-1780, 1817-1829, 1844-1851, 1856-1863, 1883-1895, 1950-1958, 1974-1990, 172-354, 384-448, 464-644, 
648-728 and 1357-1370 of Seq ID No 192; 8-27, 68-74, 77-99, 110-116, 124-141, 169-176, 201-216, 220-227, 
258-264, 274-289, 292-302, 308-324, 334-342, 344-350, 364-372, 377-387, 399-407, 416-429, 445-458, 471-481, 
483-500, 518-527, 547-553, 604-617, 621-644, 662-675, 767-778, 809-816, 15-307, 350-448 and 496-620 of Seq 
ID No 193; 4-17, 24-29, 53-59, 62-84, 109-126, 159-164, 189-204, 208-219, 244-249, 274-290, 292-302, 308-324, 
334-342, 344-350, 378-389, 391-397, 401-409, 424-432, 447-460, 470-479, 490-504, 521-529, 538-544, 549-555, 
570-577, 583-592, 602-608, 615-630, 635-647, 664-677, 692-698, 722-731, 733-751, 782-790, 793-799, 56-267, 
337-426 and 495-601 of Seq ID No 194; 12-22, 49-59, 77-89, 111-121, 136-148, 177-186, 207-213, 217-225, 
227-253, 259-274, 296-302, 328-333, 343-354, 374-383, 424-446, 448-457, 468-480, 488-502, 507-522, 544-550, 
553-560, 564-572, 587-596, 604-614. 619-625, 629-635, 638-656, 662-676, 680-692, 697-713, 720-738, 779-786, 
833^7, 861-869, 880-895, 897-902, 911-917, 946-951, 959-967, 984-990, 992-1004, 1021-1040, 1057-1067, 
1073-1080 and 381-403 of Seq ID No 195; 4-10, 26-31, 46-56, 60-66, 70-79, 86-94, 96-102, 109-118, 132-152, 
164-187, 193-206, 217-224 and 81-149 of Seq ID No 196; 4-21, 26-37, 48-60, 71-82, 109-117, 120-128, 130-136, 
142-147, 181-187, 203-211, 216-223, 247-255, 257-284, 316-325, 373-379, 395-400, 423-435, 448-456, 479-489, 
512-576, 596-625, 641-678, 680-688, 692-715 and 346-453 of Seq ID No 197; 10-16, 25-31, 34-56, 58-69, 71-89, 
94-110, 133-176, 186-193, 208-225, 240-250, 259-266, 302-307, 335-341, 376-383, 410-416 and 316-407 of Seq 
ID No 198; 11-29, 42-56, 60-75, 82-88, 95-110, 116-126, 132-143, 145-160, 166-172, 184-216 and 123-164 of 
Seq ID No 199; 11-29, 54-63, 110-117, 139-152, 158-166, 172-180, 186-193, 215-236, 240-251, 302-323, 330- 
335, 340-347, 350-366, 374-381 and 252-299 of Seq ID No 200; 18-27, 35-42, 50-56, 67-74> 112-136, 141-153, 
163-171, 176-189, 205-213, 225-234, 241-247, 253-258, 269-281, 288-298, 306-324, 326-334, 355-369, 380-387 
and 289-320 of Seq ID No 201; 7-15, 19-41, 56-72, 91-112, 114-122, 139-147, 163-183, 196-209, 258-280, 326- 
338, 357-363, 391-403, 406-416 and 360-378 of Seq ID No 202; 11-18, 29-41, 43-49, 95-108, 142-194, 204-212, 
216-242, 247-256, 264-273 and 136-149 of Seq ID No 203; 18-24, 33-40, 65-79, 89-102, 113-119, 130-137, 155- 
161, 173-179, 183-203, 205-219, 223-231, 245-261, 267-274, 296-306, 311-321, 330-341, 344-363, 369-381, 401- 
408, 415-427, 437-444, 453-464, 472-478, 484-508, 517-524, 526-532, 543-548 and 59-180 of Seq ID No 204; 5- 
13, 52-65, 67-73, 97-110, 112-119, 134-155 and 45-177 of Seq ID No 205; 6-28, 34-43, 57-67, 75-81, 111-128, 
132-147, 155-163, 165-176, 184-194, 208-216, 218-229, 239-252, 271-278, 328-334, 363-376, 381-388, 426-473, 
481-488, 492-498, 507-513, 536-546, 564-582, 590-601, 607-623, 148-269, 420-450 and 610-648 of Seq ID No 
206; 4-12, 20-38, 69-75, 83-88, 123-128, 145-152, 154-161, 183-188, 200-213, 245-250, 266-272, 306-312, 332- 
339, 357-369, 383-389, 395-402, 437-453, 45S470, 497-503 and 1-112 of Seq ID No 207; 35-59, 74-86, 111-117, 
122-137 and 70-154 of Seq ID No 208; 26-42, 54-61, 65-75, 101-107, 123-130, 137-144, 148-156, 164-172, 177- 
192, 213-221, 231-258 and 157-249 of Seq ID No 209; 29-38, 61-67, 77-87, 94-100, 105-111, 118-158 and 1-97 
of Seq ID No 210; 7-21, 30-48, 51-58, 60-85, 94-123, 134-156, 160-167, 169-183, 186-191, 216-229, 237-251, 
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IBl-Qhl, 272'7B2, 287-298 and 220-243 of Seq ID No 211; 6-29, 34-47, 56-65, 69-76, 83-90, 123-134, 143-151, 
158-178, 197-203, 217-235, 243-263, 303-3(», 320-333, 338-348, 367-373, 387-393, 407-414, 416-427, 441-457, 
473-4ffi, 487-499, 501-509, 514-520, 530-535, 577-583, 590-602, 605-612, 622-629, 641-670, 678-690, 37-71 and 
238-307 of Seq ID No 212; 7-40, 121-132, 148-161, 196-202, 209-215, 221-235, 248-255, 271-280, 288-295, 330- 

339, 395-409, 414-420, 446-451, 475-487, 556-563, 568^75, 580-586, 588-595, 633-638, 643-648, 652-659, 672- 
685, 695-700, 710-716, 737-742, 749-754> 761-767, 775-781, 796-806, 823-835, 850-863, 884-890, 892-900, 902- 
915, 934-941 and 406-521 of Seq ID No 213; 9-18, 24-46, 51-58, 67-77, 85-108, 114-126, 129-137, 139-146, 
152-165, 173-182, 188-195, 197-204, 217-250, 260-274, 296-313, 343-366, 368-384, 427-434, 437-446, 449-455, 
478-484, 492-506, 522-527, 562-591, 599-606, 609-618, 625-631, 645-652 and 577-654 of Seq ID No 214; 13-20, 
26-37, 41-53, 56-65, 81-100, 102-114, 118-127, 163-188, 196-202, 231-238, 245-252, 266-285, 293-298, 301-306 
and 19-78 of Seq ID No 215; 10-23, 32-42, 54-66, 73-91, 106-113, 118-127, 139-152, 164-173, 198-207, 210- 
245, 284-300, 313-318, 330-337, 339-346, 354-361, 387-393, 404-426, 429-439, 441-453, 467-473, 479-485, 496- 
509, 536-544, 551-558, 560-566, 569-574, 578-588, 610-615, 627-635, 649-675, 679-690, 698-716, 722-734, 743- 
754, 769-780, 782-787 and 480-550 of Seq ID No 216; 6-39, 42-50, 60-68, 76-83, 114-129, 147-162, 170-189, 
197-205, 217-231, 239-248, 299-305, 338-34^ 352-357, 371-377, 380-451, 459-483, 491-499, 507^523, 537-559, 
587-613, 625-681, 689-729, 737-781, 785-809, 817-865, 873-881, 889-939, 951-975, 983-1027, 1031-1055, 1063- 
1071, 1079-1099, 1103-1127, 1151-1185, 1197-1261, 1269-1309, 1317-1333, 1341-1349, 1357-1465, 1469-1513, 
1517-1553, 1557-1629, 1637-1669, 1677-1701, 1709-1725, 1733-1795, 1823-1849, 1861-1925, 1933-1973, 1981- 
2025, 2029-2053, 2061-2109, 2117-2125, 2133-2183, 2195-2219, 2227-2271, 2275-2299, 2307-2315, 2323-2343, 
2347-2371, 2395-2429, 2441-2529, 2537-2569, 2577-2601, 2609-2625, 2633-2695, 2699-2737, 2765-2791, 2803- 
2867, 2889-2913, 2921-2937, 2945-2969, 2977-2985, 2993-3009, 3023-3045, 3073-3099, 3111-3167, 3175-3215, 
3223-3267, 3271-3295, 3303-3351, 3359-3367, 3375-3425, 3437-3461, 3469-3513, 3517-3541, 3549-3557, 3565- 
3585, 3589-3613, 3637-3671, 3683-3747, 3755-3795, 3803-3819, 3827-3835, 3843-3951, 3955-3999, 4003-4039, 
4043-4115, 4123-4143, 4147-4171, 4195-4229, 4241-4305, 4313-4353, 4361-4377, 4385-4393, 4401-4509, 4513- 
4557, 4561-4597, 4601-4718, 4749-4768, 74-171, 452-559 and 2951-3061 of Seq ID No 217; 16-22, 30-51, 70- 
111, 117-130, 137-150, 171-178, 180-188, 191-196 and 148-181 of Seq ID No 218; 6-19, 21-46, 5056, 80-86, 
118-126, 167-186, 189-205, 211-242, 244-267, 273-286, 290-297, 307-316, 320-341 and 34-60 of Seq ID No 219; 
5-26, 33-43, 48-54, 58-63, 78-83, 113-120, 122-128, 143-152, 157-175, 185-192, 211-225, 227-234, 244-256, 270- 
281, 284-290, 304-310, 330-337, 348-355, 362-379, 384-394, 429-445, 450-474, 483-490, 511-520, 537-546, 548- 
554, 561-586, 590-604, 613-629, 149-186, 285-431 and 573-659 of Seq ID No 220; 5-26, 49-59, 61-67, 83-91, 
102-111, 145-157, 185-192, 267-272, 279-286, 292-298, 306-312, 134-220, 235-251 and 254-280 of Seq ID No 
221; 5-19, 72-79, 83-92, 119-124, 140-145, 160-165, 167-182, 224-232, 240-252, 259-270, 301-310, 313-322, 332- 
343, 347-367, 384-398, 416-429, 431-446, 454461 and 1-169 of Seq ID No 222; 8-17, 26-31, 56-62, 75-83, 93- 
103, 125-131, 135-141, 150-194, 205-217, 233-258, 262-268, 281-286 and 127-168 of Seq ID No 223; 6-12, 69- 
75, 108-115, 139-159, 176-182, 194-214 and 46-161 of Seq ID No 224; 6-13, 18-27, 39-48, 51-59, 66-73, 79-85, 
95-101, 109-116, 118-124, 144-164, 166-177, 183-193, 197-204, 215-223, 227-236, 242-249, 252-259, 261-270, 
289-301, 318-325 and 12-58 of Seq ID No 225; 4-10, 26-32, 48-60, 97-105, 117-132, 138-163, 169-185, 192-214, 
219-231, 249-261, 264-270, 292-308, 343-356, 385-392, 398-404, 408-417, 435-441 and 24-50 of Seq ID No 226; 
10-40, 42-48, 51-61, 119-126 and 1-118 of Seq ID No 227; 5-17, 40-58, 71-83, 103-111, 123-140, 167-177, 188- 
204 and 116-128 of Seq ID No 228; 4-9, 11-50, 57-70, 112-123, 127-138 and 64-107 of Seq ID No 229; 9-39, 
51-67 and 1-101 of Seq ID No 230; 5-14, 17-25, 28-46, 52-59, 85-93, 99-104, 111-120, 122-131, 140-148, 158- 
179, 187-197, 204-225, 271-283, 285-293 and 139-155 of Seq ID No 231; 42-70, 73-90, 92-108, 112-127, 152- 
164, 166-172, 181-199, 201-210, 219-228, 247-274, 295-302, 322-334, 336-346, 353-358, 396414, 419-425, 432- 
438, 462-471, 518-523, 531-536, 561-567, 576-589, 594-612, 620-631, 665-671, 697-710, 718-731, 736-756, 765- 
771, 784-801 and 626-653 of Seq ID No 232; 8-28, 41^1, 53-62, 68-74, 79-85, 94-100, 102-108, 114-120, 130- 
154> 156-162, 175-180, 198-204, 206-213, 281-294, 308-318, 321-339, 362-368, 381-386, 393-399, 407-415 and 2- 
13 of Seq ID No 233; 4-39, 48-65, 93-98, 106-112, 116-129 and 10-36 of Seq ID No 234; 25-32, 35-50, 66-71, 
75-86, 90-96, 123-136, 141-151, 160-179, 190-196, 209-215, 222-228, 235-242, 257-263, 270-280 and 209-247 of 
Seq ID No 235; 5-29, 31-38, 50^57, 62-75, 83-110, 115-132, 168-195, 197-206, 216-242, 249-258, 262-269, 333- 

340, 342-350, 363-368, 376^92, 400-406, 410-421, 423-430, 436-442, 448-454, 460-466, 471-476, 491-496, 511- 
516, 531-536, 551-556, 571.576, 585-591, 599-605, 27-70, 219-293, 441-504 and 512-584 of Seq ID No 236; 4- 
12, 14-34, 47-75, 83-104, 107-115, 133-140, 148-185, 187-196, 207-212, 224-256, 258-265, 281-287, 289-296, 
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298-308, 325-333, 345-355, 365-371, 382-395, 424-435, 441-457, 465-472, 483-491, 493-505, 528-534, 536-546, 
552-558, 575-584, 589-600, 616-623 and 576-591 of Seq ID No 237; 4-76, 78-89, 91-126, 142-148, 151-191, 
195-208, 211-223, 226-240, 256-277, 279-285, 290-314, 317-323, 358-377, 381-387, 391-396, 398-411, 415- 
434, 436-446, 454-484, 494-512, 516-523, 538-552, 559-566, 571-577, 579-596, 599-615, 620-627, 635-644, 
694-707, 720-734, 737-759, 761-771 and 313-329 of Seq ID No 238; 7-38, 44-49, 79-89, 99-108, 117-123, 125- 
132, 137-146, 178-187, 207-237, 245-255, 322-337, 365-387, 398-408, 445-462, 603-608, 623-628, 644-650, 657- 
671, 673-679 and 111-566 of Seq ED Mo 239; 6-20, 22-35, 39-45, 58-64, 77-117, 137-144, 158-163, 205-210, 
218-224, 229-236, 239-251, 263-277, 299-307, 323-334, 353-384, 388-396, 399-438, 443-448, 458-463, 467-478, 
481-495, 503-509, 511-526, 559-576, 595-600, 612-645, 711-721, 723-738, 744-758, 778-807 and 686-720 of Seq 
ID No 240; 10-33, 35-41, 72-84, 129-138, 158-163, 203-226, 243-252, 258-264, 279-302, 322-329, 381-386, 401- 
406, 414-435 and 184-385 of Seq ID No 241; 4-9, 19-24, 41-47, 75-85, 105-110, 113-146 and 45-62 of Seq ID 
No 242; 4-25, 52-67, 117-124, 131-146, 173-180, 182-191, 195-206, 215-221, 229-236, 245-252, 258-279, 286- 
291, 293-302, 314-320, 327-336, 341-353, 355-361, 383-389 and 1-285 of Seq ID No 243; 14-32, 38-50, 73-84, 
93-105, 109-114 and 40-70 of Seq ID No 244; 5-26 and 22-34 of Seq ID No 245; 23-28 and 13-39 of Seq ID 
No 246; 8-14 and 21-34 of Seq ID No 247; 4-13, 20-29, 44-50, 59-74 and 41-69 of Seq ID No 248; 4-9, 19-42, 
48-59, 71-83 and 57-91 of Seq ID No 249; 4-14 and 10-28 of Seq ID No 250; 22-28, 32-42, 63-71, 81-111, 
149-156, 158-167, 172-180, 182-203, 219-229 and 27-49 of Seq ID No 251; 17-27 and 23-32 of Seq ID No 252; 
18-24 and 28-38 of Seq ID No 253; 9-15 and 13-27 of Seq ID No 254; 13-22 and 18-29 of Seq ID No 255; 
17-26 and 2-11 of Seq ID No 256; 4-33 and 16-32 of Seq ID No 257; 4-10, 37-43, 54-84, 92-127 and 15-62 of 
Seq ID No 258; 4-14, 20-32, 35-60, 69-75, 79-99, 101-109, 116-140 and 124-136 of Seq ID No 259; 2-13 of 
Seq ID No 260; 4-13, 2842 and 42-57 of Seq ID No 261; 4-14, 27-44 and 14-35 of Seq ID No 262; 4-12 and 
1-27 of Seq ID No 263; 4-18, 39-45, 47-74 and 35-66 of Seq ID No 264; 8-20, 43-77 and 17-36 of Seq ID No 
265; 4-30, 35-45, 51-57 and 35-49 of Seq ID No 266; 4-24, 49-57 and 15-34 of Seq ID No 267; 4-22 and 8-27 
of Seq ID No 268; 13-25, 32-59, 66-80 and 21-55 of Seq ID No 269; 4-10, 24-33, 35-42, 54-65, 72-82, 98-108 
and 15-30 of Seq ID No 270; 8-19 and 17-47 of Seq ID No 271; 12-18, 40-46 and 31-52 of Seq ID No 272; 4- 
20, 35-78, 83-102, 109-122 and 74-86 of Seq ID No 273; 7-17, 21-41, 46-63 and 2-20 of Seq ID No 274; 30-37 
and 2-33 of Seq ID No 275; 4-13, 17-25 and 1-15 of Seq ID No 276; 17-31, 44-51 and 20-51 of Seq ID No 
277; 20-30 and 5-23 of Seq ID No 278; 13-33, 48-71 and 92-110 of Seq ID No 279; 4-9, 50-69, 76-88, 96-106, 
113-118 and 12-34 of Seq ID No 280; 4-24 and 6-26 of Seq ID No 281; 7-26 and 14-30 of Seq ID No 282; 9- 
39, 46-68, 75-82, 84-103 and 26-44 of Seq ID No 283; 4-30, 33-107 and 58-84 of Seq ID No 284; 4-12 and 9- 
51 of Seq ID No 285; 12-18, 29-37 and 6-37 of Seq ID No 286; 4-21, 33-52, 64-71 and 16-37 of Seq ID No 
287; 9-19 and 2-30 of Seq ID No 288; 20-37 of Seq ID No 245; 8-27 of Seq ID No 246; 10-27 of Seq ID No 
247; 42-^9 and 52-69 of Seq ID No 248; 63-80 and 74-91 of Seq ID No 249; 11-28 of Seq ID No 250; 28-49 
of Seq ID No 251; 15-32 of Seq ID No 252; 4-20 of Seq ID No 253; 10-27 of Seq ID No 254; 17-34 of Seq 
ID No 255; 1-18 of Seq ID No 256; 16-33 of Seq ID No 257; 16-36, 30-49 and 43-62 of Seq ID No 258; 122- 
139 of Seq ID No 259; 1-18 of Seq ID No 260; 41-58 of Seq ID No 261; 15-35 of Seq ID No 262; 2-27 of Seq 
ID No 263; 18-36 of Seq ID No 265; 34-51 of Seq ID No 266; 9-27 of Seq ID No 268; 22-47 of Seq ID No 
269; 18-36 and 29-47 of Seq ID No 271; 32-52 of Seq ID No 272; 72-89 of Seq ID No 273; 3-20 of Seq ID 
No 274; 3-21 and 15-33 of Seq ID No 275; 1-18 of Seq ID No 276; 6-23 of Seq ID No 278; 93-110 of Seq ID 
No 279; 13-34 of Seq ID No 280; 7-26 and 9-26 of Seq ID No 281; 16-33 of Seq ID No 282; 27-44 of Seq ID 
No 283; 67-84 of Seq ID No 284; 10-33 and 26-50 of Seq ID No 285; 7-25 and 19-37 of Seq ID No 286; 17- 
37 of Seq ID No 287; 3-20 and 13-30 of Seq ID No 288; 62-80 and 75-93 of Seq ID No 145; 92-108 of Seq 
ID No 147; 332-349, 177-200 and 1755-1777 of Seq ID No 148; 109-133, 149-174, 260-285 and 460-485 of 
Seq ID No 149; 2647 and 42-64 of Seq ID No 150; 22-41, 35-54, 115-130, 306-325, 401-420 and 454-478 of 
Seq ID No 151; 22-45 of Seq ID No 155; 156-174, 924-940, 1485-1496, 1447-1462 and 1483-1498 of Seq ID 
No 160; 457-475 of Seq ID No 161; 302-325 of Seq ID No 163; 288-305 of Seq ID No 164; 244-266 and 260- 
282 of Seq ID No 165; 204-225 and 220-241 of Seq ID No 166; 324-345, 340-361, 356-377, 372-393 and 388- 
408 of Seq ID No 167; 39-64 of Seq ID No 168; 54-76 and 70-92 of Seq ID No 169; 1227-1247, 1539-1559, 
1554-1574, 1569-1589, 1584-1604, 1242-1262, 1272-1292, 1287-1308, 1456-1477, 1472-1494, 1488-1510 and 
1505-1526 of Seq ID No 170; 351-368 of Seq ID No 172; 179-200, 195-216, 211-232, 227-248 and 243-263 of 
Seq ID No 173; 13-37 of Seq ID No 174; 208-224 of Seq ID No 175; 42-64, 59-81, 304-328, 323-348, 465-489, 
968-992, 1399-1418, 1412-1431 and 2092-2111 of Seq ID No 177; 1528-1547 and 1541-1560 of Seq ID No 
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178; 184-200, 367-388, 382-403, 409-429, 425-444 and 438-457 of Seq ID No 179; 27-50 and 45-67 of Seq ID 
No 180; 114-131 and 405-419 of Seq ID No 183; 113-134, 129-150, 145-166, 161-182 and 177-198 of Seq ID 
No 184; 495-515 of Seq ID No 186; 346-358 of Seq ID No 187; 208-224 of Seq ID No 190; 178-194, 202-223, 
217- 238, 288-308 and 1355-1372 of Seq ID No 192; 57-78 of Seq ID No 194; 347-369, 364-386, 381^03, 398- 
420, 415-437 and 432-452 of Seq ID No 197; 347-3Z2 of Seq ID No 198; 147-163 of Seq ID No 199; 263-288 
of Seq ID No 200; 361-377 of Seq ID No 202; 82-104, 99-121, 116-138, 133-155 and 150-171 of Seq ID No 
204; 110-130 and 125-145 of Seq ID Mo 205; 613-631, 626-644 and 196-213 of Seq ID No 206; 78-100, 95- 
117, 112-134 and 129-151 of Seq ID No 208; 158-180, 175-197, 192-214, 209-231 and 226-248 of Seq ID No 
209; 3(W0, 45-65 and 60-79 of Seq ID No 210; 431-455 and 450-474 of Seq ID No 213; 579-601, 596-618, 
613-635 and 630-653 of Seq ID No 214; 920-927, 98-119, 114-135, 130-151, 146-167 and 162-182 of Seq ID 
No 217; 36-59 of Seq ID No 219; 194-216 and 381-404 of Seq ID No 220; 236-251 and 255-279 of Seq ID 
No 221; 80-100 and 141-164 of Seq ID No 222; 128-154 of Seq ID No 223; 82-100, 95-116 and 111-134 of 
Seq ID No 224; 55-76, 71-92 and 87-110 of Seq ID No 227; 91-106 of Seq ID No 229; 74-96 of Seq ID No 
230; 140-157 of Seq ID No 231; 4-13 of Seq ID No 233; 41-65 and 499-523 of Seq ID No 236; 122-146, 191- 
215, 288^13, 445-469 and 511-535 of Seq ID No 239; 347^68 of Seq ID No 241; 46-61 of Seq ID No 242; 
15-37, 32-57, 101-121, 115-135, 138-158, 152-172, 220-242 and 236-258 of Seq ID No 243, and fragments 
comprising at least 6, preferably more than 8, especially more than 10 aa and preferably not more than 70, 
50, 40, 20, 15, 11 aa of said sequences. All these fragments individually and each independently form a 
preferred selected aspect of the present invention. 

All linear hyperimmune serum reactive fragments of a particular antigen may be identified by analysing 
the entire sequence of the protein antigen by a set of peptides overlapping by 1 amino add with a length 
of at least 10 amino acids. Subsequently, non-linear epitopes can be identified by analysis of the protein 
antigen with hyperimmime sera using the expressed full-length protein or domain polypeptides tiiereof . 
Assuming that a distinct domain of a protein is sufficient to form tfie 3D structure independent from the 
native protein, the analysis of the respective recombinant or synthetically produced domain poljrpeptide 
with hyperimmune serum would allow Hie identification of conformational epitopes v^thin the 
individual domains of multi-domain proteins. For those antigens where a domain possesses linear as well 
as conformational epitopes, competition experiments with peptides corresponding to the linear epitopes 
may be used to confirm the presence of conformational epitopes. 

It will be appreciated that the invention also relates to, among others, nucleic add molecules encoding the 
aforementioned fragments, nudeic acid molecules that hybridise to nudeic add molecules encoding the 
fragments, particularly those that hybridise under stringent conditions, and nudeic add molecules, such 
as PGR primers, for amplifying nudeic add molecules that encode the fragments. In these regards, 
preferred nudeic add molecules are those that correspond to the preferred fragments, as discussed 
above. 

The present invention also relates to vectors, which comprise a nucleic add molecule or nucleic add 
molecules of the present invention, host cells whidi are genetically engineered with vectors of the 
invention and the production of h3rperimmime serum reactive antigens and fragments thereof by 
recombinant techniques. 

A great variety of expression vectors can be used to express a hyperimmune serum reactive antigen or 
fragment thereof according to the present invention. Generally, any vector suitable to maintain, 
propagate or express nudeic adds to express a polypeptide in a host may be used for expression in this 
regard. In accordance with this aspect of the invention the vector may be, for example, a plasmid vector, 
a single or double-stranded phage vector, a single or double-stranded RNA or DNA viral vector. Starting 
plasmids disdosed herein are either commercially available, publidy available, or can be constructed 
from available plasmids by routine application of well-known, published procedures. Preferred among 
vectors, in certain respects, are those for expression of nudeic add molecules and hyperimmune serum 
reactive antigens or fragments thereof of the present invention. Nudeic add constructs in host cells can 
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be used in a conventional manner to produce the gene product encoded by the recombinant sequence. 
Alternatively; the hyperimmtme serum reactive antigens and fragments thereof of the invention can be 
synthetically produced by conventional peptide S5mtiiesizers. Mature proteins can be expressed in 
mammalian cells, yeast, bacteria, or other cells under the control of appropriate promoters. Cell-free 
translation systems can also be employed to produce such proteins using RNAs derived from the DNA 
construct of ihe present invention. 

Host cells can be genetically engineered to incorporate nucleic add molecules and express nucleic acid 
molecules of the present invention. Representative examples of appropriate hosts include bacterial cells, 
such as streptococci, staphylococci, £. coli, Streptomyces and BacUltis subtUlis cells; fungal cells, such as 
yeast cells and Aspergillus cells; insect cells such as Drosophila 82 and Spodoptera Sf9 cells; animal cells 
such as CHO, COS, Hela, C127, 3T3, BHK, 293 and Bowes melanoma cells; and plant cells. 

The invention also provides a process for producing a S. pneumoniae hyperimmune serum reactive 
antigen and a fragment thereof comprising expressing from Ihe host cell a h3rperimmune serum reactive 
antigen or fragment thereof encoded by the nucleic add molecules provided by the present invention. 
The invention further provides a process for producing a cell, which expresses a S. pneumoniae 
hyperimmune serum reactive antigen or a fragment thereof comprising transforming or transfecting a 
suitable host cell with the vector according to the present invention such that the transformed or 
transf ected cell expresses the polypeptide encoded by the nudeic add contained in the vector. 

The polypeptide may be expressed in a modified form, such as a fusion protein, and may indude not 
only secretion signals but also additional heterologous functional regions. Thus, for instance, a region of 
additional amino adds, particularly charged amino adds, may be added to the N- or C-terminus of the 
polypeptide to improve stability and persistence in the host cell, during purification or during 
subsequent handling and storage. Also, regions may be added to the polypeptide to facilitate 
purification. Such regions may be removed prior to final preparation of the polypeptide. The addition of 
peptide moieties to poljqjeptides to engender secretion or excretion, to improve stability or to fadlitate 
purification, among others, are familiar and routine techniques in the art. A preferred fusion protein 
comprises a heterologous region from immunoglobulin that is useful to solubilize or purify polypeptides. 
For example, EP-A-O 464 533 (Canadian counterpart 2045869) disdoses fusion proteins comprising 
various portions of constant region of immunoglobin molecules together witii another protein or part 
thereof. In drug discovery, for example, proteins have been fused wiih antibody Fc portions for the 
purpose of high-throughout screening assays to identify antagonists. See for example, {Bennett, D. et al., 
1995} and ffohanson, K. et al., 1995}. 

The S. pneumoniae hj^^erimmime serum reactive antigen or a fragment thereof can be recovered and 
piuified from recombinant cell cultures by well-known methods induding ammonivun sulfate or ethanol 
predpitatiorv add extraction, anion or cation exchange diromatography, phosphoceilulose 
chromatography, hydrophobic intemction chromatography, hydroxylapatite chromatography and lectin 
chromatography. 

The hyperimmune senun reactive antigens and fragments thereof according to the present invention can 
be produced by chemical synthesis as well as by biotechnological means. The latter comprise the 
transfection or transformation of a host cell with a vector containing a nudeic add according to the 
present invention and the cultivation of the transfected or transformed host cell under conditions, which 
are known to the ones skilled in the art. The production method may also comprise a purification step in 
order to purify or isolate the polypeptide to be manufactured. In a preferred embodiment tiie vector is a 
vector according to the present invention. 



The hyperimmune serum reactive antigens and fragments tiiereof according to the present invention may 
be used for the detecticm of the organism or organisms in a sample containix^ these organisms or 
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polypeptides derived thereof. Preferably such detection is for diagnosis, more preferable for the diagnosis 
of a disease, most preferably for the diagnosis of a diseases related or linked to tihe presence or abimdance 
of Gram-positive bacteria, especially bacteria selected from the group comprising streptococci, 
staphylococci and lactococd. More preferably, the microorganisms are selected from the group 
comprising Streptococcus agalactiae. Streptococcus pyogenes and Streptococcus mutmts, especially the 
microorganism is Streptococcus pyogenes. 

The present invention also relates to diagnostic assays such as quantitative and diagnostic assays for 
detecting levels of the hyperimmune serum reactive antigens and fragments thereof of the present 
invention in cells and tissues, including determination of normal and abnormal levels. Thus, for instance, 
a diagnostic assay in accordance with the invention for detecting over-expression of the polypeptide 
compared to normal control tissue samples may be used to detect the presence of an infection, for 
example, and to identify the infecting organism. Assay techniques that can be used to determine levels of 
a polypeptide, in a sample derived from a host are well known to those of skill in the art. Such assay 
methods include radioimmunoassays, competitive-binding assays. Western Blot analysis and ELISA 
assays. Among these, ELISAs frequently are preferred. An EUSA assay initially comprises preparing an 
antibody specific to the polypeptide, preferably a monoclonal antibody. In addition, a reporter antibody 
generally is prepared whidi binds to the monodonal antibody. The reporter antibody is attached to a 
detectable reagent such as radioactive, fluorescent or enzjrmatic reagent, such as horseradish peroxidase 
enz3nne. 

The hjrperimmune serum reactive antigens and fragments thereof according to the present invention may 
also be used for the purpose of or in connection with an array. More particularly, at least one of the 
hyperimmune serum reactive antigens and fragments thereof according to the present invention may be 
immobilized on a support. Said support typicaDy comprises a variety of hyperimmune sertun reactive 
antigens and fragments thereof whereby the variety may be created by using one or several of the 
hyperimmune serum reactive antigens and fragments thereof according to the present invention and/or 
hyperimmune serum reactive antigens and fragments thereof being different. The characterizing feature 
of such array as well as of any array in general is the fact that at a distinct or predefined region or 
position on said support or a surface thereof, a distinct polypeptide is immobilized. Because of this any 
activity at a distinct position or region of an array can be correlated with a specific polj^eptide. The 
number of different hyperimmtme serum reactive antigens and fragments thereof immobilized on a 
support may range from as littie as 10 to several 1000 different hyperimmxme serum reactive antigens 
and fragments thereof. The density of hyperimmime serum reactive antigens and fragments thereof per 
cm2 is in a preferred embodiment as littie as 10 peptides/polypeptides per cm^ to at least 400 different 
peptides/polypeptides per cm^ and more particularly at least 1000 differmt hyperimmune serum reactive 
antigens and fragments thereof per cm^. 

The manufacture of such arrays is known to the one skilled in the art and, for example, described in US 
patent 5,744,309. The array preferably comprises a planar, porous or non-porous solid support having at 
least a first surface. The hyperimmune serum reactive antigens and fragments thereof as disclosed herein, 
are immobilized on said surface. Preferred support materials are, among others, glass or cellulose. It is 
also witiiin the present invention that the array is used for any of tiie diagnostic applications described 
herein. Apart from the hyperimmune serum reactive antigens and fragments thereof according to the 
present invention also the nucleic add molecules according to the present invention may be used for tfie 
generation of an array as described above. This applies as well to an array made of antibodies, preferably 
monodonal antibodies as, among others, described herein. 

In a further aspect the present invention relates to an antibody directed to any of the hyperimmune 
serum reactive antigens and fragments thereof, derivatives or fragments thereof according to the present 
invention. The present invention indudes, for example, monodonal and polydonal antibodies, chimeric, 
single chain, and humanized antibodies, as well as Fab fi:agments, or the product of a Fab expression 
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library. It is within the present invention that the antibody may be chimeric, L e. that different parts 
thereof stem from different species or at least the respective sequences are taken from different species. 

Antibodies generated against the h3^eriinmune serum reactive antigens and fragments thereof 
corresponding to a sequence of the present invention can be obtained by direct injection of the 
hyperimmune serum reactive antigens and fragments thereof into an animal or by administering the 
hyperimmune serum reactive antigens and fragments thereof to an animal, preferably a non-human. The 
antibody so obtained will then bind the h3^erimmxme serum reactive antigens and fragments thereof 
itself. In this manner, even a sequence encoding only a fragment of a h3T>erimmune serum reactive 
antigen and fragments thereof can be used to generate antibodies binding the whole native hyperimmune 
serum reactive antigen and fragments thereof. Such antibodies can then be used to isolate the 
h3rperimmune serum reactive antigens and fragments thereof from tissue expressing those hyperimmune 
serum reactive antigens and fragments thereof. 

For preparation of monoclonal antibodies, any technique known in the art which provides antibodies 
produced by continuous cell line cultures can be used (as described originally in {Kohler, G. et aL, 1975}. 

Techniques described for the production of single chain antibodies (U.S. Patent No. 4,946,778) can be 
adapted to produce single chain antibodies to immunogenic hyperimmime serum reactive antigens and 
fragments tiiereof according to this invention. Also, transgenic mice, or other organisms such as other 
mammals, may be used to express himumized antibodies to immunogenic hyperimmime serum reactive 
antigens and fragments thereof according to this invention. 

Alternatively, phage display technology or ribosomal display could be utilized to select antibody genes 
with binding activities towards the hyperimmune serum reactive antigens and fragments thereof either 
from repertoires of PGR amplified v-genes of lymphocytes from humans screened for possessing 
respective target antigais or from na£ve libraries {McCafferty, J. et al., 1990}; {Marks, J. et al., 1992). The 
a£Einity of these antibodies can also be improved by chain shuffling {Clackson, T. et al., 1991}. 

If two antigen binding domains are present, each domain may be directed against a different epitope - 
termed 'bispedfic' antibodies. 

The above-described antibodi^ may be employed to isolate or to identify clones expressing the 
hyperimmune serum reactive antigens and fragments thereof or purify the hyperimmune serum reactive 
antigens and fragments thereof of the present invention by attachment of the antibody to a solid support 
for isolation and/or purification by affinity chromatography. 

Thus, among others, antibodies against the h3rperimmune serum reactive antigens and fragments thereof 
of the present invention may be employed to inhibit and/or treat infections, particularly bacterial 
infections and especially infections arising from 5. pneumoniae. 

Hyperimmune serum reactive antigens and fragments thereof include antigenically, epitopically or 
immunologically equivalent derivatives, which form a particular aspect of this invention. The term 
"antigenically equivalent derivative" as used herein encompasses a hyperimmune serum reactive antigen 
and fragments thereof or its equivalent which wOl be specifically recognized by certain antibodies which, 
when raised to the protein or hyperimmune serum reactive antigen and fragments thereof according to 
the present invention, interfere witii the interaction between pathogen and mammalian host The term 
"immunologically equivalent derivative" as used herein encompasses a peptide or its equivalent which 
when used in a suitable formulation to raise antibodies in a vertdbrate, the antibodies act to interfere with 
the interaction between pathogen and mammaUan host 

The hyperimmune serum reactive antigens and fragments thereof, such as an antigenically or 
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immunologically equivalent derivative or a fusion protein thereof can be used as an antigen to immunize 
a mouse or other animal such as a rat or chicken. The fusion protein may provide stability to the 
hyperimmune serum reactive antigens and fragments thereof. The antigoi may be associated, for 
example by conjugation^ with an immunogenic carrier protein, for example bovine serum albumin (BSA) 
or keyhole limpet haemocyanin (KLH). Alternatively, an antigenic peptide comprising multiple copies of 
the protein or hyperimmune serum reactive antigen and fragmoits thereof, or an antigenicaUy or 
immunologically equivalent hjq^erimmune serum reactive antigen and fragments thereof, may be 
sufficientiy antigmic to improve immunogenidty so as to obviate the use of a carrier. 

Preferably the antibody or derivative thereof is modified to make it less immunogenic in the individual. 
For example, if the individual is human the antibody may most preferably be "humanized'^ wherein tiie 
complimentarity determining region(s) of ttie hybridoma-derived antibody has been transplanted into a 
human monoclonal antibody, for example as described in {Jones, P- et al., 1986} or {Tempest, P. et al., 
1991). 

The use of a pol3mucleotide of the invention in genetic immunization wiD preferably employ a suitable 
delivery method such as direct injection of plasmid DNA into muscle, delivery of DNA complexed with 
specific protein carriers, coprecipitation of DNA witii calcium phosphate, encapsulation of DNA in 
various forms of liposomes, particle bombardment {Tang, D. et al., 1992); {Eisenbraim, M. et aL, 1993} and 
in vivo infection using cloned retroviral vectors {Seeger, C. et al., 1984). 

In a further aspect the present invention relates to a peptide binding to any of the hyperimmime serum 
reactive antigens and fragments thereof according to the present invention, and a metiiod for the 
manufacture of such peptides whereby the method is characterized by the use of tiie hyperimmune 
serum reactive antigens and fragments thereof according to the present invention and the basic steps are 
known to the one skilled in the art. 

Such peptides may be generated by using metiiods according to the state of the art such as phage display 
or ribosome display. In case of phage display, basically a library of peptides is gaierated, in form of 
phages, and tiiis kind of library is contacted with the target molecule, in the preset case a hyperimmune 
serum reactive antigen and fragments thereof according to the present invention. Those peptides binding 
to the target molecule are subsequentiy removed, preferably as a complex with the target molecule, from 
the respective reactioru It is known to the one skilled in tiie art that the binding characteristics, at least to a 
certain extent, depend on the particularly realized experimental set-up such as the salt concentration and 
the like. After separating those peptides binding to the target molecule with a higher affinity or a bigger 
force, from the non-binding members of tiie library, and optionally also after removal of the target 
molecule from the complex of target molecule and peptide, the respective peptide(s) may subsequentiy 
be characterised. Prior to the characterisation optionally an amplification step is realized such as, e. g. by 
propagating tiie peptide encoding phages. The characterisation preferably comprises the sequencing of 
the target binding peptides. Basically, tiie peptides are not limited in their lengths, however, preferably 
peptides having a lengths from about 8 to 20 amino adds are preferably obtained in the respective 
metiiods. The size of liie libraries may be about ICP to 10", preferably 10* to 10« different peptides, 
however, is not limited tiiereto. 

A particular form of target binding hj^perimmune serum reactive antigais and fragments thereof are the 
so-called "anticalines" which are, among others, described in German patent application DE 197 42 706. 

In a further aspect the present invention relates to functional nucleic adds interacting with any of the 
h5rperimmune serum reactive antigens and fragments tiiereof according to the present invention, and a 
method for the manufacture of such functional nudeic adds whereby tiie method is characterized by the 
use of the h3^erimmune serum reactive antigens and fragmants thereof according to tiie present 
invention and tiie basic steps are known to the one skilled in the art Hie functional nudeic adds are 
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preferably aptamers and spiegelmers. 

Aptamers are D-nudeic adds, whidi are either single stranded or double stranded and whidi spedfically 
interact wilii a target molecule. The manufacture or selection of aptamers is, e.g. described in European 
patent EP 0 533 838. Basically the following steps are realized. First a mixture of nudeic adds, i. e. 
potential aptamers, is provided whereby each nudeic add typically comprises a segment of several, 
preferably at least dght subsequent randomised nudeotides. This mixture is subsequently contacted with 
the target molecule whereby the nudeic add(s) bind to the target molecule, such as based on an increased 
affinity towards lite target or with a bigger force thereto, compared to the candidate mixture. The binding 
nudeic add(s) are/is subsequently separated from tiie remainder of the mixture. Optionally, the thus 
obtained nudeic add(s) is amplified using, e.g. polymerase chain reaction. These steps may be repeated 
several times giving at the end a mixture having an increased ratio of nudeic adds spedfically binding to 
the target from which the final binding nudeic add is then optionally selected. These spedfically binding 
nudeic add(s) are rrferred to as aptamers. It is obvious that at any stage of the method for the generation 
or identification of the aptamers samples of the mixture of individual nudeic adds may be taken to 
determine the sequence tiiereof using standard techniques. It is within the present invention that the 
aptamers may be stabilized such as, e. g., by introducing defined diemical groups which are known to 
the one skilled in the art of gen^ting aptamers. Sudi modification may for example reside in the 
introduction of an amino group at the 2'-position of the sugar moiety of the nudeotides. Aptamers are 
currently used as therapeutical agents. However, it is also within the present invention that the thus 
selected or generated aptamers may be used for target validation and/or as lead substance for the 
development of medicaments, preferably of medicaments based on small molecules. This is actually done 
by a competition assay whereby the specific interaction between the target molecule and the aptamer is 
inhibited by a candidate drug whereby upon replacement of the aptamer from the complex of target and 
aptamer it may be assumed that the respective drug candidate allows a specific inhibition of the 
interaction between target and aptamer, and if the interaction is spedfic, said candidate drug will, at least 
in prindple, be suitable to block the target and thus decrease its biological availability or activity in a 
respective system comprising such target The thus obtained small molecule may then be subject to 
further derivatisation and modification to optimise its physical, chemical, biological and/or medical 
characteristics such as toxidty, spedfidty, biodegradability and bioavailability. 

Spiegelmers and their generation or manufacture is based on a similar prindple. The manufacture of 
spiegelmers is described in international patent application WO 98/08856. Spiegelmers are L-nudeic 
adds, which means that they are composed of L-nucleotides rather than D-nudeotides as aptamers are. 
Spiegelmers are characterized by the fact that they have a very high stability in biological systems and, 
comparable to aptamers, specifically interact witii tfie target molecule against which they are directed. In 
the process of generating spiegelmers, a heterogeonous population of D-nudeic adds is created and this 
population is contacted with the optical antipode of the target molecule, in the present case for example 
with the D-enantiomer of the naturally occurring L-enantiomer of the hyperimmtme serum reactive 
antigens and fragments thereof according to the present invention. Subsequently, those D-nucleic acids 
are separated which do not interact with the optical antipode of the target molecule. But those D-nucleic 
adds interacting with the optical antipode of the target molecule are separated, optionally identified 
and/or sequenced and subsequently the corresponding L-nudeic adds are synthesized bcised on the 
nudeic acid sequence information obtained from the D-nudeic adds. These L-nudeic acids whidv are 
identical in terms of sequence with the aforementioned D-nuddc adds interacting with the optical 
antipode of the target molecule, will specifically interact with the naturally occurring target molecule 
rather than with the optical antipode thereof. Similar to the method for the generation of aptamers it is 
also possible to repeat the various steps several times and thus to enrich those nudeic adds specifically 
interacting with the optical antipode of the target molecule. 

In a furtiter aspect the present invention rdates to functional nudeic adds interacting with any of the 
nudeic add molecules according to the present inventioiv and a method for the manufacture of such 
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functicmal nucleic adds whereby the mediod is characterized by the use of the nudeic add molecules and 
their respective sequences according to the present invention and the basic steps are known to the one 
skilled in the art The functional nudeic adds are preferably ribozymes, antisense oligonudeotides and 
siRNA. 

Ribozymes are catalytically active nudeic adds which preferably consist of RNA which basically 
comprises two moieties. The first moiety shows a cataljrtic achvity whereas the second moiety is 
responsible for the specific interaction with the target nudeic add, in the present case the nucleic add 
coding for the hyperimmune serum reactive antigens and fragments thereof according to the present 
invention. Upon interaction betweai the target nudeic add and the second moiety of the ribozyme, 
typically by hybridisation and Watson-Crick base pairing of essentially complementary stretches of bases 
on the two hybridising strands, the catalytically active moiety may become active which means that it 
catalyses, either intramolecularly or intermolecularly, the target nudeic add in case the catalytic activity 
of the ribozyme is a phosphodiesterase activity. Subsequently, there may be a further degradation of the 
target nudeic add, which in ttie end results in Ihe degradation of the target nudeic add as well as the 
protein derived from the said target nudeic add. Ribozymes, their use and design prindples are known 
to the one skilled in the art, and, for example described in poherty, E. et al., 2001} and {Lewin, A. et al., 
2001). 

The activity and design of antisense oligonudeotides for tfie manufacture of a medicament and as a 
diagnostic agent, respectively, is based on a similar mode of action. Basically, antisense oligonudeotides 
hybridise based on base complementarity, with a target RNA, preferably with a mRNA, thereby 
activating RNase H. RNase H is activated by both phosphodiester and phosphorothioate-coupled DNA. 
Phosphodiester-coupled DNA, however, is rapidly degraded by cellular nucleases with the exception of 
phosphorothioate-coupled DNA, These resistant, non-naturally occurring DNA derivatives do not inhibit 
RNase H upon hybridisation with RNA. In other words, antisense polynudeotides are only effective as 
DNA RNA hybride complexes. Examples for this kind of antisense oligonudeotides are described, 
among others, in US-patent US 5,849,902 and US 5,989,912. In other words, based on the nucleic add 
sequence of the target molecule which in the present case are the nucleic acid molecules for the 
h5rperimmune serum reactive antigens and fragments thereof according to the present invention, either 
from the target protein from which a respective nudeic add sequence may in prindple be deduced, or by 
knowing the nudeic add sequence as such, particularly the mRNA, suitable antisense oligonudeotides 
may be designed base on the principle of base complementarity. 

Particularly preferred are antisense-oligonudeotides, which have a short stretch of phosphorothioate 
DNA (3 to 9 bases). A minimiun of 3 DNA bases is reqmred for activation of bacterial RNase H and a 
minimum of 5 bases is required for mammalian RNase H activation. In these chimeric oligonudeotides 
there is a central region that forms a substrate for RNase H that is flanked by hybridising "arms" 
comprised of modified nucleotides that do not form substrates for RNase H. The hybridising arms of the 
chimeric oligonudeotides may be modified such as by 2'-Omethyl or 2'-fluoro. Alternative approaches 
used methylphosphonate or phosphoramidate linkages in said arms. Further embodiments of the 
antisense oligonucleotide useful in the practice of the present invention are P-methoxyoligonudeotides, 
partial P-methoxyoligodeox3rribonudeotides or P-methoxyoligonudeotides. 

Of particular relevance and usefulness for the present invention are those antisense oligonudeotides as 
more particularly described in the above two mentioned US patents. These oligonudeotides contain no 
naturally occurring 5'->3'-linked nudeotides. Rather the oligonudeotides have two types of nudeotides: 
2'-deoxyphosphorotiiioate, whidi activate RNase H, and 2'-modified nudeotides, which do not. The 
linkages between the 2'-modified nudeotides can be phosphodiesters, phosphorothioate or P- 
ethoxyphosphodiester. Activation of RNase H is accomplished by a contiguous RNase H-activating 
region, whidi contains between 3 and 5 2'-deox3^hosphorotiiioate nudeotides to activate bacterial RNase 
H and between 5 and 10 2'- deoxyphosphorothioate nudeotides to activate eucaryotic and, particularly. 
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mammalian RNase H. Protection from degradation is accomplished by making the 5' and 3' terminal 
bases highly nuclease resistant and, optionally, by placing a 3' terminal blocking group. 

More particularly, the antisense oligonucleotide comprises a 5' terminus and a 3' terminus; and from 
position 11 to 59 5'->3'-linked nucleotides independently selected from the group coi\sisting of 2'- 
modified phosphodiester nucleotides and 2'-modified P-alkyloxyphosphotriester nucleotides; and 
wherein the 5'-tenninal nucleoside is attached to an RNase H-activating region of between three and ten 
contiguous phosphorothioate-linked deoxyribonucleotides, and wherein the 3'-terminus of said 
oligonucleotide is selected from tiie group consisting of an inverted deo?cjfribonucleotide, a contiguous 
stretch of one to three phosphorothioate 2'-modified ribonucleotides, a biotin group and a P- 
alkyloxjrphosphotriester nucleotide. 

Also an antisense oligonucleotide may be used wherein not the 5' terminal nucleoside is attached to an 
RNase H-activating region but the 3' terminal nucleoside as specified above. Also, the 5' terminus is 
selected from the particular group rather them the 3' terminus of said oligonucleotide. 

The nucleic acids as well as the hyperimmune serum reactive antigens and fragments thereof according 
to the present invention may be used as or for the manufacture of pharmaceutical compositions, 
especially vaccines. Preferably such pharmaceutical composition, preferably vaccine is for the prevention 
or treatment of diseases caused by, related to or associated with S. pneumoniae. In so far another aspect of 
the invention relates to a method for inducing an immunological response in an Individual, particularly a 
mammal, which comprises inoculating the individual with the hj^erimmime serum reactive antigens 
and fragments thereof of the invention, or a fragment or variant thereof, adequate to produce antibodies 
to protect said individual from infection, particularly streptococcal infection and most particularly S. 
pneumoniae infections* 

Yet another aspect of the invention relates to a method of inducing an immunological response in an 
individual which comprises, through gene therapy or otherwise, delivering a nucleic acid functionally 
encoding hyperimmune serum reactive antigens and fragments thereof, or a fragment or a variant 
thereof, for expressing the hyperimmune serum reactive antigens and fragments titereof, or a fragment or 
a variant thereof in vivo in order to induce an immunological response to produce antibodies or a cell 
mediated T cell response, either cjrtokine-producing T cells or cytotoxic T cells, to protect said individual 
from disease, whether that disease is already established within the individual or not One way of 
administering tiie gene is by accelerating it into the desired cells as a coating on particles or otherwise. 

A further aspect of tiie invention relates to an immimological composition which, when introduced into a 
host capable of having induced within it an immunological response, induces an immunological response 
in such host, wherein the composition comprises recombinant DNA which codes for and expresses an 
antigen of the hyperimmune serum reactive antigens and fragments thereof of the present invention. The 
immunological response may be used therapeutically or prophylactically and may take the form of 
antibody immunity or cellular immimity such as that arising from CTL or CD4+ T cells. 

The hyperiimnune serum reactive antigens and fragments thereof of the invention or a fragment thereof 
may be fused with a co-protein which may not by itself produce antibodies, but is capable of stabilizing 
the first protein and producing a fused protein which will have immtmogenic and protective properties. 
This fused recombinant protein preferably further comprises an antigenic co-protein, such as 
Glutathione-S-transferase (GST) or beta-galactosidase, relatively large co-proteins which solubilise the 
protein and facilitate production and purification thereof. Moreover, tiie co-protein may act as an 
adjuvant in the sense of providing a generalized stimulation of the immime system. The co-protein may 
be attached to either tite amino or carboxy terminus of the first protein. 

Also, provided by this invention are methods using the described nucleic add molecule or particular 
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fragments thereof in such genetic immunization experiments in animal models of infection with S, 
pneumoniae. Such fragments will be particularly useful for identifying protein epitopes able to provoke a 
prophylactic or therapeutic immune response. This approach can allow for the subsequent preparation of 
monoclonal antibodies of particular value from the requisite organ of the animal successfully resisting or 
clearing infection for the development of prophylactic agents or therapeutic treatments of S. pneumoniae 
infection in mammals, particularly htunans. 

The hyperimmune serum reactive antigens and fragments thereof may be used as an antigen for 
vaccination of a host to produce specific antibodies which protect against invasion of bacteria, for 
example by blocking adherence of bacteria to damaged tissue. Examples of tissue damage include 
wounds in skin or connective tissue and mucosal tissues caused e.g. by viral infection (esp. respiratory, 
such as the flu) mechanical, chemical or thermal damage or by implantation of indwelling devices, or 
woimds in the mucous membranes, such as the mouth, mammary glands, urethra or vagina. 

The present invention also includes a vaccine formulation, which comprises the immunogenic 
recombinant protein together with, a suitable carrier. Since the protein may be broken down in the 
stomach, it is preferably administered parenterally, including, for example, administration that is 
subcutaneous, intramuscular, intravenous, intradermal intranasal or tramsdermal. Formulations suitable 
for parenteral administration include aqueous and non-aqueous sterile injection solutions which may 
contain anti-oxidants, buffers, bacteriostats and solutes which render the formidation isotonic with the 
bodUy fluid, preferably the blood, of the individual; and aqueous and non-aqueous sterile suspensions 
which may include suspending agents or thickening agents. The formulations may be presented in unit- 
dose or multi-dose containers, for example, sealed ampoules and vials, and may be stored in a freeze- 
dried condition requiring only the addition of the sterile liquid carrier immediately prior to use. The 
vaccine formulation may also indude adjuvant systems for enhancing the immimogenidty of the 
formulation, such as oil-in-water systems and other systems known in the art The dosage will depend on 
the specific activity of the vaccine and can be readily determined by routine experimentation. 

According to another aspect the present invention relates to a pharmaceutical composition comprising 
such a hyperimmime serum-reactive antigen or a fragment thereof as provided in the present invention 
for S. pneumoniae. Such a pharmaceutical composition may comprise one, preferably at least two or more 
hyperimmune serum reactive antigens or fragments thereof against S. pneumoniae. Optionally, such S. 
pneumoniae hyperimmune serum reactive antigens or fragments thereof may also be combined with 
antigens against other pathogens in a combination pharmaceutical composition. Preferably, said 
pharmaceutical composition is a vaccine for preventing or treating an infection caused by S. pneumoniae 
and/or other pathogens against which the antigens have been induded in the vaccine. 

According to a further aspect, the present invention relates to a pharmaceutical composition comprising a 
nudeic add molecule encoding a hjqjerimmune serum-reactive antigen or a fragment thereof as 
identified above for S. pneumoniae. Such a pharmaceutical composition may comprise one or more nucleic 
add molecules encoding hyperimmtme serum reactive antigens or fragments thereof against S. 
pneumoniae. Optionally, such S. pneumoniae nucleic add molecules encoding hyperimmune serum 
reactive antigens or fragments thereof may also be combined with nucleic add molecules encoding 
antigens against other pathogens in a combination pharmaceutical composition. Preferably, said 
pharmaceutical composition is a vaccine for preventing or treating an infection caused by S. pjieumoniae 
and/or other pathogens against which the antigens have been induded in the vaccine. 

The pharmaceutical composition may contain any suitable atixiliary substances, such as buffer 
substances, stabilisers or further active ingredients, espedally ingredients known in coimection of 
pharmaceutical composition and/or vaccine production. 

A preferable carrier/or exdpient for the hyperimmune serum-reactive antigens, fragments thereof or a 



wo 2004/092209 



PCT/EP2004/003984 



-39- 

coding nucleic add molecule thereof according to tite present invention is an immunostimulatory 
compound for further stimulating the immune response to the given hyperimmime serum-reactive 
antigen, fragment thereof or a coding nudeic add molecule thereof. Preferably tiie immimostimulatory 
compoimd in the pharmaceutical preparation according to the present invention is selected from the 
group of polycationic substances, especially polycationic peptides, immunostimulatory nudeic adds 
molecules, preferably immunostimulatory deoxynudeotides, alum, Freund's complete adjuvants, 
Freund's incomplete adjuvants, neuroactive compoimds, especially human growth hormone, or 
combinations thereof. 

It is also within the scope of the present invention that the pharmaceutical composition, espedally 
vaccine, comprises apart from the hyperimmime serum reactive antigens, fragments thereof and/or 
coding nudeic add molecules tiiereof according to the present invention other compounds which are 
biologically or pharmaceutically active. Preferably, the vaccine composition comprises at least one 
polycationic peptide. The polycationic compound(s) to be used according to the present invention may be 
any polycationic compound, which shows tiie characteristic effects according to the WO 97/30721. 
Preferred polycationic compoimds are sdected from basic polyppetides, organic polycations, basic 
poiyamino adds or mixtures thereof. These polyamino adds should have a chain length of at least 4 
amino add residues (WO 97/30721). Espedally preferred aie substances like polylysine, polyarginine and 
polypeptides contauning more than 20 %, espedally more than 50 % of basic amino adds in a range of 
more than 8, espedally more than 20, amino add residues or mixtures thereof. Other preferred 
polycations and their pharmaceutical compositions are described in WO 97/30721 (e.g. 
polyethyleneimine) and WO 99/38528. Preferably these polypeptides contain between 20 and 500 amino 
add residues, espedally between 30 and 200 residues. 

These polycationic compounds may be produced diemicaliy or recombinantiy or may be derived from 
natural sources. 

Cationic (poly)peptides may also be anti-microbial with properties as reviewed in {Ganz, T., 1999}. Ttiese 
(poly)peptides may be of prokaryotic or animal or plant origin or may be produced chemically or 
recombinantiy (WO 02A3857). Peptides may also belong to tiie dass of defensins (WO 02/13857). 
Sequences of such peptides can be, for example, found in the Antimicrobial Sequences Database under 
the following internet address: 

htft?;//mmf??b<?n.univ,trieste.it/-tossi/pag2Jitn^ 

Such host defence peptides or defensives are also a preferred form of the polycationic polymer according 
to the present invention. Generally, a compoimd allowing as an end product activation (or down- 
regulation) of the adaptive immune system, preferably mediated by APCs (including dendritic cells) is 
used as polycationic polymer. 

Espedally preferred for use as polycationic substances in the present invention are catheliddin derived 
antimicrobial peptides or derivatives tiiereof (bitemational patent application WO 02/13857, incorporated 
herein by reference), espedally antimicrobial peptides derived from mammalian catheliddin, preferably 
from human, bovine or mouse. 

Polycationic compounds derived from natural sources indude HIV-REV or HIV-TAT (derived cationic 
peptides, antennapedia peptides, chitosan or other derivatives of chitin) or other peptides derived from 
these peptides or proteins by biochemical or recombinant production. Other preferred polycationic 
compounds are cathelin or related or derived substances from catheUn. For example, mouse cathelin is a 
peptide which has the amino add sequence NH2-RLAGLIJRKGGEKIGEKUaKIGOKIK^^ 
COOH. Related or derived cathelin substances contain the whole or parts of the cathelin sequence with at 
least 15-20 amino add residues. Derivations may indude the substitution or modification of the natural 
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amino acids by amino adds which are not among the 20 standard amino adds. Moreover, further cationic 
residues may be introduced into such cathelin molecules. These cathelin molecules are preferred to be 
combined with the antigen. These caHielin molecules surprisingly have turned out to be also effective as 
an adjuvant for an antigen without the addition of furtitier adjuvants. It is therefore possible to use such 
cathelin molecules as effident adjuvants in vaccine formulations with or without further 
immunactivating substances. 

Another preferred polycationic substance to be used according to the present invention is a synthetic 
peptide containing at least 2 KLK-motifs separated by a linker of 3 to 7 hydrophobic amino adds 
(International patent application WO 02/32451, incorporated herein by reference). 

The pharmaceutical composition of ttte present invention may furtfier comprise immimostimidatory 
nucleic add(s). Immunostimulatory nudeic adds are e. g. neutral or artifidal CpG containing nucleic 
adds, short stretdies of nudeic adds derived from non-vertebrates or in form of short oligonudeotides 
(ODNs) containing non-methylated cytosine-guanine di-nudeotides (CpG) in a certain base context (e.g. 
described in WO 96/02555). Alternatively, also nuddc adds based on inosine and cytidine as e.g. 
described in the WO 01/93903, or deoxynucldc adds containing deoxy-inosine and/or deoxyuridine 
residues (described in WO 01/93905 and PCT/EP Q2/(©448, incorporated herein by reference) may 
preferably be used as immunostimulatory nudeic adds for the present invention. Preferablly, the 
mixtures of different immunostimulatory nudeic adds may be used according to the present invention- 
It is also within the present invention that any of the aforementioned polycationic compoimds is 
combined with any of the immunostimulatoiy nudeic adds as aforementioned. Preferably, such 
combinations are according to ti\e ones as described in WO 01/93905, WO 02/32451, WO 01/54720, WO 
01/93903, WO 02/13857 and PCT/EP 02/05448 and the Austrian patent application A 1924/2001, 
incorporated herein by reference. 

In addition or alternatively such vaccine composition may comprise apart from the hyperimmune serum 
reactive antigens and fragments thereof, and the coding nudeic add molecules thereof according to tiie 
present invention a neuroactive compound. Preferably, the neuioactive compound is human growth 
factor as, e.g. described in WO 01/24822. Also preferably, the neuroactive compoxmd is combined with 
any of the polycationic compounds and/or immunostimulatory nudeic adds as afore-mentioned. 

In a further aspect the present invention is rdated to a pharmaceutical composition. Such pharmaceutical 
composition is, for example, the vacdne described herein. Also a pharmaceutical composition is a 
pharmaceutical composition whidi comprises any of the following compounds or combinations thereof: 
the nudeic add molecules according to the present invention, the hyperimmune serum reactive antigens 
and fragments thereof according to the present invention, the vector according to the present invention, 
the cells according to the present invention, the antibody according to the present invention, the 
functional nudeic acids according to the present invention and the binding peptides such as the 
anticalines according to the present invention, any agonists and antagonists screened as described herein. 
In connection tfierewith any of these compoimds may be employed in combix\ation with a non-sterile or 
sterile carrier or carriers for use with cells, tissues or organisms, such as a pharmaceutical carrier suitable 
for administration to a subject. Such compositions comprise, for instance, a media additive or a 
therapeutically effective amount of a hyperimmune serum reactive antigen and fragments thereof of the 
invention and a pharmaceutically acceptable carrier or exdpient. Such carriers may indude, but are not 
limited to, saline, btiffered saline, dextrose, water, glycerol, ethanol and combinations thereof. The 
formulation should suit the mode of administration. 

The pharmaceutical compositions may be administered in any effective, convenient maimer induding, 
for instance, admiitistration by topical oral, anal, vaginal, intravenous, intraperitoneal, intramuscular, 
subcutaneous, intranasal, intratracheal or intradennal routes among others. 
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In therapy or as a prophylactic, the active agent may be administered to an individual as an injectable 
composition, for example as a sterile aqueous dispersion, preferably isotonic. 

Alternatively the composition may be formulated for topical application, for example in the form of 
ointments, creams, lotions, eye ointmaits, eye drops, ear drops, mouthwash, impregnated dressings and 
sutures and aerosols, and may contain appropriate conventional additives, including, for example, 
preservatives, solvents to assist drug penetration, and emollients in ointments and creams. Such topical 
formulations may also contain compatible conventional carriers, for example cream or ointment bases, 
and ethanol or oleyl alcohol for lotions. Such carriers may constitute from about 1 % to about 98 % by 
weight of the formulation; more usually they will constitute up to about 80 % by weight of the 
formulation. 

In addition to the therapy described above, the compositions of this invention may be used generally as a 
wotmd treatment agent to prevent adhesion of bacteria to matrix proteins exposed in woimd tissue and 
for prophylactic use in dental treatment as an alternative to, or in conjunction with, antibiotic 
prophylaxis. 

A vaccine composition is conveniently in injectable form. Conventional adjuvants may be employed to 
enhance the immune response. A suitable imit dose for vaccination is 0.05-5 ^g antigen / per kg of body 
weight, and such dose is preferably administered 1-3 times and with an interval of 1-3 weeks. 

With the indicated dose range, no adverse toxicological effects should be observed with the compounds 
of the invention, which would preclude their administration to suitable individuals. 

In a further embodiment the present invention relates to diagnostic and pharmaceutical packs and kits 
comprising one or more containers filled with one or more of Ihe ingredients of the aforementioned 
compositions of the invention. The ingredient(s) can be present in a useful amoxmt, dosage, formulation 
or combination. Associated with such container(s) can be a notice in tiKe form prescribed by a 
governmental agency regulating the manufacture^ use or sale of pharmaceuticals or biological products, 
reflecting approval by the agency of the manufacture, use or sale of the product for htiman 
administration. 

In connection with the present invention any disease related use as disclosed herein such as, e. g. use of 
the pharmaceutical composition or vaccine, is particularly a disease or diseased condition which is 
caused by, linked or associated with Streptococci, more preferably, S. pneumoniae. In connection therewith 
it is to be noted that S. pneumoniae comprises several strains including those disclosed herein. A disease 
related, caused or associated with the bacterial infection to be prevented and/or treated according to the 
present invention includes besides others bacterial pharyngitis, otitis media, pneumonia, bacteremia^ 
meningitis, peritonitis and sepsis in humans. 

In a still further embodiment the present invention is related to a screening method using any of the 
hyperimmune serum reactive antigens or nucleic acids according to the present invention. Screening 
methods as such are known to the one skilled in the art and can be designed such that an agonist or an 
antagonist is screened. Preferably an antagonist is screened which in tiie present case inhibits or prevents 
the binding of any hyperimmune serum reactive antigen and fragment thereof according to the present 
invention to an interaction partner. Such interaction partner can be a naturally occurring interaction 
partner or a non-naturally occurring interaction partner. 

The invention also provides a method of screening compotmds to identify those, which enhance (agonist) 
or block (antagonist) the function of hyperimmune serum reactive antigens and fragments thereof or 
nucleic add molecules of the present invention, such as its interaction with a binding molecule. The 
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mediod of screening may involve hig^-throug|iput 

For example/ to screen for agonists or antagonists, ttie interaction partner of the nudeic acid molecule and 
nucleic add, respectively, according to ihe present invention, maybe a synthetic reaction mix, a cellular 
compartment, such as a membrane, cell envelope or cell wall, or a preparation of any thereof, may be 
prepared from a cell that expresses a molecule that binds to the hjrperimmune serum reactive antigens 
and fragments thereof of the presmt invention. The preparation is incubated with labelled hj^erimmune 
serum reactive antigens and fragments ttiereof in the absence or the presence of a candidate molecule, 
whidi may be an agonist or antagonist. The ability of the candidate molecule to bind the binding 
molecule is reflected in decreased binding of the labelled ligand. Molecules which bind gratuitously, i. e., 
without inducing the functional effects of the h5rperimmime serum reactive antigens and fragments 
thereof, are most likely to be good antagonists. Molecules that bind well and elidt functional effects that 
are the same as or dosely related to tiie hyperimmune serum reactive antigois and fragments thereof are 
good agonists. 

The functional effects of potential agonists and antagonists may be measured, for instance, by 
determining the activity of a reporter system following interaction of the candidate molecule with a cell 
or appropriate cell prq>aration, and comparing the effect with that of the hjrperimmune serum reactive 
antigens and fragments thereof of the present invention or molecules that elidt the same effects as the 
hyperimmune serum reactive antigens and fragments thereof. Reporter systems that may be useful in this 
regard indude but are not limited to colorimetric labelled substrate converted into product, a reporter 
gene that is responsive to changes in the functional activity of ttie hyperimmune serum reactive antigens 
and fragments thereof, and binding assays known in the art. 

Another example of an assay for antagonists is a competitive assay that combines the hyperinunune 
serum reactive antigens and fragments thereof of the present invention and a potential antagonist with 
membrane-bound binding molecules, recombinant binding molecules, natural substrates or ligands, or 
substrate or ligand mimetlcs, under appropriate conditions for a competitive inhibition assay. The 
h3rperimmune serum reactive antigens and fragments thereof can be labelled such as by radioactivity or a 
colorimetric compound, sudi that the molecule number of hyperimmune serum reactive antigens and 
fragments thereof bound to a binding molecule or converted to product can be determined accurately to 
assess the effectiveness of the potential antagonist. 

Potential antagonists include small organic molecules, peptides, polypeptides and antibodies that bind to 
a hjrperimmime serum reactive antigen and fragments thereof of the invention and thereby inhibit or 
extinguish its adtivity. Potential antagonists also may be small organic molecules, a peptide, a 
polypeptide such as a dosely related protein or antibody that binds to the same sites on a binding 
molecule without inducing functional activity of the hyperimmune serum reactive antigei\s and 
fragments thereof of the invention. 

Potentizd antagonists indude a small molecule, which binds to and occupies the binding site of the 
h3^erimmune serum reactive antigens and fragments thereof tfiereby preventing binding to cellular 
binding molecules, such that normal biological activity is prevented. Examples of small molecules 
indude but are not limited to small organic molecules, peptides or peptide-like molecules. 

Other potential antagonists indude antisense molecules (see {Okano, H. et aL, 1991}; 
OUGODEOXYNUCLEOTDDES AS ANTISENSE INHIBITORS OF GENE EXPRESSION; CRC Press, Boca 
Ration, FL (1988), for a description of these molecules). 

Preferred potential antagonists indude derivatives of the hyperimmune serum reactive antigens and 
fragments thereof of the invention. 
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As used herein the activity of a hyperimmxine serum reactive antigen and fragment thereof according to 
the present invention is its capability to bind to any of its interaction partner or tihe extent of such 
capability to bind to its or any interaction partner. 

In a particular aspect, the invention provides the use of the h3rperimmune serum reactive antigens and 
fragments thereof, nucleic add molecules or inhibitors of the invention to interfere with the initial 
physical interaction between a pathogen and mammalian host responsible for sequelae of infection. In 
particular the molecules of the invention may be used: i) in the prevention of adhesion of S. pneumoniae to 
mammalian extracellular matrix proteins at mucosal surfaces and on in-dwelling devices or to 
extracellular matrix proteins in woimds; ii) to block bacterial adhesion between mammalian extracelliilar 
matrix proteins and bacterial proteins which mediate tissue damage or invasion iii) or lead to evasion of 
iznmtme defense; iv) to block the normal progression of patiiogenesis in infections initiated other than by 
the implantation of in-dwelling devices or by otiier surgical techniques, e.g. through inhibiting nutrient 
acquisition {Brown, J. et al., 2001}. 

Each of the DNA coding sequences provided herein may be used in the discovery and development of 
antibacterial compounds. The encoded protein upon expression can be used as a tai^et for the screening 
of antibacterial drugs. Additionally, the DNA sequences encoding the amino terminal regions of the 
encoded protein or Shine-Delgamo or other translation facilitating sequences of the respective mRNA can 
be used to construct antisense sequences to control tiie expression of the coding sequence of interest 

The antagonists and agonists may be employed, for instance, to inhibit diseases arising from infection 
with Streptococcus, especially S. pneumoniae, such as sepsis. 

In a still further aspect the present invention is related to an affinity device such affinity device comprises 
as least a support material and any of the hyperimmune serum reactive antigens and fragments thereof 
according to tiie presait invention, which is attached to Ae support material. Because of the specificity of 
the hjrperimmune serum reactive antigens and fragments thereof according to the present invention for 
their target cells or target molecules or their interaction partners, the h3rperimmune serum reactive 
antigens and fragments thereof allow a selective removal of ttieir interaction partner(s) from any kind of 
sample applied to the support material provided that the conditions for binding are met. The sample may 
be a biological or medical sample, including but not limited to, fermentation broth, cell debris, cell 
preparation, tissue preparation, organ preparation, blood, tuine, lymph liquid, liquor and the like. 

The hyperinumme serum reactive antigens and fragments thereof may be attached to the matrix in a 
covalent or non-covalent maimer. Suitable support material is known to the one skilled in the art and can 
be selected from the group comprising cellulose, silicon, glass, aluminium, paramagnetic beads, starch 
and dextrane. 

The present invention is further illustrated by the following figures, examples and the sequence listing, 
from which further features, embodiments and advantages may be taken. It is to be understood that the 
present examples are given by way of illustration only and not by way of limitation of the disclosure. 

Jn connection with the present invention 

Figure 1 shows the characterization of S. pneumoniae specific human sera. 

Figure 2 shows the characterization of tiie small fragment genomic library, LSPn-70, from Streptococcus 
pneumoniae serotype 4. 

Figure 3 shows the selection of bacterial cells by MACS using biotinylated human IgGs. 
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Figure 4 shows an example for the gene distribution study with ttie identified antigens. 

Figure 5 shows examples of dianges in epitope-spedfic antibody levels in the different age groups and 
during the course of pneumococcal disease. 

Figure 6 shows examples for cell surface staining wilh epitope-spedfic antisera by flow cytometry. 

Figure 7 shows the determination of bacteriddal activity of antibodies induced by selected epitopes in an 
in vitro assay. 

Figure 8 shows the protective effect of active immunization with selected S. pteumoftiae antigens in a 
murine lethality sepsis model. 

Figure 9 shows the protective effect of passive immunization with sera generated with selected S. 
ptteumoniae antigens in a murine lethality sepsis model. 

Figure 10 shows the identification of the protective domain within the SP2216 antigen. 

Figure 11 shows that antibodies induced by protective antigei\s are cross-reactive with the different S. 

pneumoniae serotypes. 

Figure 12 shows the aligmnent of amino add sequences of natural SF2216 variants 

Figure 13 shows the alignment of amino add sequences of natural SF1732 variants 

Figure 14 shows the aligrmient of amino add sequences of natural SF2190 variants 

Table 1 shows the summary of all screens performed with genomic S. pneumoniae libraries and human 
serum. 

Table 2 shows the summary of epitope serology analysis with human serau 

Table 3 shows the summary of the gene distribution analysis for the identified antigens in 50 S. 
pneumoniae strains. 

Table 4 shows the summary of the surface staining and bacteriddal activity measurements. 

The figures to which it might be referred to in the specification are described in the following in more 
details. 

Figure 1 shows tiie characterization of human sera for anti-5. pneumoniae antibodies ass measured by 
immune assays. Total anti-S. pneumoniae IgG and IgA antibody levels were measured by standard ELI5A 
using total bacterial lysates or culture supemant fractions prepared from S. pneumoniae serotype 4 capsule 
negative mutant strain as coating antigens. 97 serum samples from convalescing patients with inasive 
diseases or 50 sera from healthy adults without nasopharyngeal carriage of S. pneununtiae were analysed 
at three different serum dilutions. Results of representative experiments are shown with (A) patients' sera 
with bacterial lysate and (B) healti\y adult sera witti culture supemtant proteins. Data are expressed as 
ELISA units calculated from absorbance at 405nm at a serum dilution in the linear range of detection 
(lO.OOOX for IgAf 50,000 for IgG). 2x5 sera from both donor groups were selected and pooled for antigen 
identification by bacterial surface display. Selected sera induded in the two patient (PSFn3-IgG,-IgA and 
PSPn7-IgG) and two healtihy pools (NSPn4-IgG,-IgA and NSPn5-IgG) are indicated by drcules. (C) 
Immunoblot analysis was performed on sera pre-selected by ELJSA in order to ensure multiple immune 
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reactivity with protein antigens. Results of a representative experiment using total bacterial lysate 
prepared from S. pneumoniae serotype 4 capsule negative mutant strain and selected patients' sera at 
5.000X dilution are shown. Not selected, low titer sera were included as negative controls. Mw: molecular 
weigth markers. (D) Surface staining of S. pneumoniae serotype 4 capsule negative mutant strain was 
performed by FAGS to compare antibody binding to surface located antigens. Human sera were used at 
different concentrations (0.5-5%). Representative data are shown with patients' sera used at 0.5% final 
concentration. Signal was detected with FTTC-labeled anti-human IgGFab and analysed with the 
computer program CELLQuest. (E) Correlation between IgG titers measured by EUSA using total 
bacterial lysates and surface staining of whole living S, pneumoniae with serum IgGs is shown. IgG titer is 
expressed as EUSA units, while surface staining is expressed as mean fluorescence of stained bacteria 
calculated by the computer program CELLQuest . 

Figure 2 (A) shows the fragment size distribution of the Streptococcus pneumoniae type 4 small fragment 
genomic library, LSFn-70. After sequencing 609 randomly selected clones sequences were trimmed to 
eliminate vector residues and the number of dones with various genomic fragment sizes were plotted. 
(B) shows graphic illustration of the distribution of the same set of randomly sequenced clones of LSPn- 
70 over the S. pneumoniae chromosome (according to the TIGR4 genome data). Cirdes indicate matching 
sequences to annotated ORFs and rectangles represent fully matdied dones to non-coding chromosomal 
sequences in +/+ or +/- orientation. Diamonds position all dones with chimeric sequences. Numeric 
distances in base pairs are indicated over the circular genome for orientation. Partitioning of various 
done sets witiiin the library is given in numbers and percentage at the bottom of the figure. 

Figure 3 (A) shows the MACS selection with biotinylated human IgGs. The LSPn-70 library in pMAL9.1 
was screened with 10 \ig biotinylated IgG (PSPn3-IgG, purified from human serum). As negative control, 

no serum was added to the library cells for screening. Ntmiber of ceUs sdected after the 1^ and 2^^ 
elution are shown for each selection round (upper and lower panel, respectively). (B) shows the reactivity 
of specific dones (1-26) selected by bacterial siurface display as analysed by immunoblot analysis with the 
hiunan serum IgG pool (FSPn7-IgG, 4fig/jil) used for selection by MACS at a dilution of 1:3,000. As a 
loading control the same blot was also analysed with antibodies directed against the platform protein 
LamB at a dilution of 15,000 of h}^erimmune rabbit serum. LB, Extract from a clone expressing LamB 
without foreign peptide insert 

Figure 4 (A) shows the representation of different serotypes of S. pneumoniae clinical isolates analysed for 
tiie gene distribution study. (B) shows the PGR analysis for tiie gene disbibution of SP1604 with the 
respective oligonucleotides. The predicted size of the PCR fragments is 470 bp. 1-50, S. pneumoniae strains, 
clinical isolates as listed imder A; -, no genomic DNA added; +, genomic DNA from S. pneumoniae 
serotype 4, which served as template for library construction. 

Figure 5 shows the EUSA measiurement of epitope-spedfic human serum IgG antibody levels during 
pneumococcal disease. Iluee serum samples were collected longitudinally firom patients with invasive 
pneiunococcal disease, before disease occurred (pre), in the acute and convalescent phases. 
Representative experiments are shown with two sets of sera from two different patients, (A) F1147 and 
(B) P1150 reacted with peptides representing the identified antigens SP0069, SP0082, SP0117, SP1175, 
SP1937, SP2190 and SP2216, eis indicated. Biotin-labeled peptides were reacted with human serum 
samples at 200X and l.OOOX dilutions and data are expressed as EUSA lanits. 

Figure 6 shows the detection of specific antibody binding on the cell surface of Streptococcus pneumoniae 
by flow c3rtometry. In Figure 5A preimmune mouse sera and polyclonal sera raised against S. pneumoniae 
serotype 4 lysate were incubated with S, pneumoniae strain serotype 4 and analysed by flow c5rtometry. 
Control shows the levd of non-specific binding of the secondary antibody to the surface of S. pneumoniae 
cells. The histograms in figure 5B indicates the increased fluorescence due to specific binding of anti- 
SP2216, anti-SP0117, anti-SP0454 and anti-CRF1992 antibodies in comparison to tiie control sera against 
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the platform protein LamB. 

Figure 7 shows the bactericidal activity of epitope specific antibodies as determined in in xntro killing 
assay. The killing activity of immune sera is measured parallel with and calculated relative to the 
appropriate control sera. Data are expressed as percentage of killing, that is the reduction on bacterial cfu 
numbers as a consequence of the presence of antibodies in hyperimmune (HI) polyclonal mouse sera 
generated with S. piteumoniae lysate (A), in immime sera generated with SP0117 epitopes expressed in the 
LamB platform protein (B), and in mouse immune sera generated with SP1287 epitopes expressed in the 
FhuA platform protein (O- The control sera represent preimmune sera (PI), sera induced with Lamb or 
FhuA expressing E. coli clones without S. pneumonia-derived epitopes. S. pneumonaie serotype 4 cells were 
incubated with mouse phagocj^c cells for 60 mirv and surviving bacteria were quantified by counting 
cfus after plating on blood agar. 

Figure 8 shows the protection achieved by active immunization with selected S. pneumoniae antigens in a 
mouse lethality model. C3H mice (10 in each test groups) were immunized with recombinant antigens 
cloned from a serotype 4 S. pneumoniae strain and challenged with serotype 6B strain. Survival was 
monitored for 2 to 3 weeks post-challenge. A: Mice were immunized with SP0368, SP0667r SP2190 and 
SP2216 antigens and challenged intravenously with 10* cfu bacteria. Nonimmimized mice were used as 
negative controls, while PspA (SP0117) served as positive control. B: Mice were immimized with SP2190 
and SP2216 antigens and challenged intraperitonealy with 10^ cfu bacteria. Mice injected with PBS or 
mock immunized with the adjuvants only (CFA/IFA) were used as negative controls, while PspA 
(SF0117) served as positive control. C: Mice were immunized with SP0498 and SP1732 antigens and 
challenged intraperitonealy with 10^ cfu bacteria. Mice injected with PBS were used as negative controls, 
while PspA (SP0117) served as positive control. 

Figiue 9 shows the protection achieved by passive immunization with hyperimmvine mouse sera 
generated with selected S. pneumoniae antigens in a mouse lethality model. C3H mice (10 in each test 
groups) were given mouse sera intraperitoneaUy 2 hrs before intraperitoneal challenge with IQs cfu S. 
pneumoniae serotype 6B bacteria. Survival was monitored for 3 weeks post-challenge. ISOfil inmnme sera 
generated with SP2190 or SP2216 were given and supplemented with 150[al serum from naive mice, 
except for mice receiving lOOjil each of anti-SP2190, anti-SP2216 unmtine sera and 100 |-d serum from 
naive mice. Negative controls were treated either with 300^1 sera from PBS injected, noninjected (naive) 
or nonimmime CFA/IFA injected mice. 

Figure 10 shows the identification of the protective domain witiiin the SP2216 antigen. A: Schematic 
representation of the SP2216 antigen indicating the two subdomains predicted by in sUico (structural 
prediction) analysis and the localization of epitopes identified by bacterial surface display (grey bars and 
arrows). B: C3H mice (10 in each test groups) were immunized with recombinant SP2216 antigens: full- 
length, N-terminal or C-terminal domains and challenged with 5. pneumoniae serotype 6B strain given 
intraperitoneally 10^ cfus. Survival was monitored for 2 to 3 weeks post-challenge. Nonimmunized 
(CFA/IFA adjuvant injected) mice were used as negative controls, while PspA (SP0117) served as positive 
control. 

Figure U shows the cross-reactivity of antibodies by analysing different S. pneumoniae serotjq>es. 
Immunoblot analysis was performed with bacterial lysates prepared from 60 clinical isolates of S. 
pneumoniae representing 48 different serot5^es and using sera generated with SP1732, SP2190 and SP2216 
recombinant antigens cloned from a serotype 4 strain in order to test the cross-reactivity of antibodies. 
Results with seven different serotypes (Lanes 1-7) are shown as representative data taken firom the 
complete analysis. Mw: molecular weight markers. 

Figure 12 shows the amino add exchanges detected in' natural SP2216 variants expressed in different 
clinical isolates of S. pneummiiae. The SP2216 gene from 47 different clinical isolates representing 47 
different S. pneumoniae serotypes were analysed by DNA sequencing. The translated amino add 
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sequences are shown for those serotypes where amino add exchanges were detected relative to the 
published TIGR4 genome sequences. 

Figure 13 shows the amino add exchanges detected in natural SP1732 variants expressed in different 
clinical isolates of S. pneumoniae. The SP1732 gene from six different dinical isolates representing six 
major S. pneumoniae serot3rpes (4, 6B, 9V, 18C, 19F, 23F) were analysed by DNA sequencing. The 
translated amino add sequences are shown for those serotypes where amino add exchanges were 
detected relative to the published TIGR4 (serot3rpe 4) genome sequences. 

Figure 14 shows the amino add exchanges detected in natural SP2190 variants expr^sed in different 
clinical isolates of S. pneumoniae. The SP2190 gene from seven different clinical isolates representing seven 
major S. pneumoniae serotypes (4, 6B, 9V, 14, 18C, 19F, 23F) were analysed by DNA sequencing. The 
translated amino add sequences are shown for those serotypes where amino acid exchanges were 
detected relative to the published TIGR4 (serot3^e 4) genome sequences. Due to missing sequence 
information from the middle part of some of the genes, the N-terminal and C-terminal aa alignments are 
shown separately. A: N-terminal amino add sequences; B: C-terminal amino add sequences. 

Table 1: Immunogenic proteins identified by bacterial surface display. 

A, 300bp library in fhuA with NSPn4-IgA (362), B, 300bp library in fhuA with NSPn4-IgG (832), Q 300bp 
library in fhuA with NSPn5-IgG (872), D, 300bp library in fhuA with PSPn3-IgA (361), E, 300bp library in 
fhuA with PSPn3-IgG (575), F, 300bp library in fhuA with PSPl\7-IgG (795), G, 70bp library in lamB witti 
NSPn4-IgA (1043), H, 70bp Hbrary in lamB with NSPn4-IgG (929), I, 70bp library in lamB with NSPn5-IgG 
(527), K, 70bp hbrary in lamB with PSPn3-IgA (1121), L, 70bp Hbrary in lamB with PSPn3-IgG (1242), M, 
70bp library in lamB with PSPn7-IgG (514); *, prediction of antigenic sequences longer than 5 amino acids 
was performed witii the program ANTIGENIC {Kolaskar, A. et al., 1990}. 

Table 2: Epitope serology with human sera. 

Immune reactivity of individueil syntiietic peptides representing selected epitopes with individual human 
sera is shown. Extent of reactivity is pattern/grey coded; white: - (<50U), light grey: + (50-119U), dark 
gery: ++ (120-199U), black: +++ P00-500U) and vertically crossed: ++++ (> 500U). EIJSA units (U) are 
calculated from OD405nm readings and the serum dilution after correction for background. S stands for 
score, calculated as the sum of all reactivities (addition of the number of all +); PI to P13 sera are 
measured to be high titer and are from patients with invasive pentunococcal diseases and Nl to NIO sera 
are from healthy adults with high anti-S. pneumoniae titers. S stands for score. Which is the sum of 
immune reactivities: - =0; + =1; ++ =2; +++ «3 and ++++ =4. Location of synthetic peptides within the 
antigenic ORFs according to the genome annotation of TIGR4 strain are given in columns from and to 
indicating the first and last amino add residues, respectively. Peptide names: SP0117.1-7 present in 
annotated ORFSP0117; ARF0408.1, potential novel ORF in alternative reading-frame of SP0408; 
CRF0129.1, potential novel ORF on complement of SP0129. 

Table 3: Gene distribution in S. pneumoniae strains. 

Fifty S. pneumoniae strains as shown in Figure 4A were tested by PGR with oligonudeotides specific for 
the genes encoding relevant antigens. The PGR fragment of one selected PGR fragment was sequenced in 
order to confirm the amplification of the correct DNA fragment. * number of amino add substitutions in 
a serotype 14 strain as compared to S. ptteumoniae TIGR4 (serotype 4). #, alternative strain used for 
sequencing, because gene was not present in the serotype 14 strain. 

Table 4: Surfeice location of antigenic epitopes and the functionality of the epitope-spedfic antibodies. 

45 S. pneumoniae antigens were tested for surface localization in the way described and presented in 
Figure 6 by using mouse sera generated by immunization with £. coli dones harboring plasmids 
encoding the platform proteins LamB or FhuA fused to a S. pneumoniae peptide. Data are summarized in 
the coltunn labeled FAGS. The very same immune reagents were used in an m xntro killing assay, as 
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shown in Figure 7 for the examples^ and presented for all antigens tested positive by FACS in column PK 
(phagocytic killing). -: negative result, +: not consistently positive in all assays performed, ++ and +++ are 
consistently positive relative to control reagents. 

EXAMPLES 

Eisample 1: Characteri^tion and selection of human sera based anti-S. pneununnae antibodies, 
preparation of antibody screening reagents 

Experimental procedures 

Enzyme linked immune assay (EUSA). 

ELISA plates (Maxisorb, Millipore) were coated with 5-10 fig/ml total protein diluted in coating buffer 
(O.IM sodium carbonate pH 9^). Three dilutions of sera (2,O00X, lO^OOOX, 50,000X) were made in PBS- 
BSA. Highly specific Horse Radish Peroxidase (HRP)-conjugated anti-human IgG or anti-human IgA 
secondary antibodies (Southern Biotech) were used according to the manufacturers* recommendations 
(dilution: l,CX}Ox). Antigen-antibody complexes were quantified by measuring the conversion of the 
substrate (ABTS) to colored product based on ODdosnm readings by automatic ELIAS reader (TECAN 
SUNRISE). 

Preparation cf bacterial antigen extracts 

Total bacterial lysate: Bacteria were grown overnight in THB (Todd-Hewitt Broth) and lysed by repeated 
freeze-thaw cycles: incubation on dry ice/ethanol-mixture until frozen (1 min), then thawed at 37<C (5 
min): repeated 3 times. This was followed by sonication and collection of supernatant by centrifugation 
(3^00 rpm, 15 min, 40C). 

Culture supernatant: After removal of bacteria by centrifugation, the supernatant of overnight grown 
bacterial cultures was precipitated with ice-cold ethanol by mixing 1 part supernatant with 3 parts abs. 
ethanol and incubated overnight at -20°C. Precipitates were collected by centrifugation (2,600 g, for 15 
min). Dry pellets were dissolved either in PBS for EIJSA, or in urea and SDS-sample buffer for SDS- 
PAGE and immunoblotting. The protein concentration of samples was determined by Bradford assay. 

Immunoblotting 

Total bacterial l3rsate and culture supernatant samples were prepared from in vitro grown S. pneumoniae 
serotype 4 uncapsulated mutant strain. 10 to 25|ag total protein/lane was separated by SDS-PAGE using 
the BioRad Mini-Protean 3 Ceil electrophoresis system and proteins transferred to nitrocellulose 
membrane (ECL, Amersham Pharmacia). After ovemigiht blocking in 5% milk, human sera were added at 
2,00Qx dilution, and HRPO labeled anti-himian IgG was used for detectioxu 

Surface staining cf bacteria 

Flow c5rtometric analysis was carried out as follows. S. pneumoniae serotype 4 imcapsulated mutant strain 
was grown in Todd-Hewitt broth overnight until early stationary phase. Cells were collected and washed 
twice in Hanks Balanced Salt Solution (HBSS) and the cell density was adjusted to approximately 1 X 10^ 
CFU in lOOfil HBSS with 0.5% BSA based on OD600 nm readings. After incubation with human sera at 
0,5 and 2% final concentration for 60 min at 4**C, imboimd antibodies were washed away by 
centrifugation in excess HBSS, 0.5% BSA. For detection fluorescein (FTTC) labeled secondary goat anti- 
human IgG (F(ab')2 fragment specific) was incubated with the cells at 4°C for 30 min. After washing the 
cells, cells were fixed with 2% paraformaldehyde. Surface staining antibodies were detected using a 
Becton Dickinson FACScan flow qrtometer and data further analyzed with the computer program 
CELLQuest. 

Purification of antibodies for genomic screening. Five sera from both the patient and the healthy group were 
selected based on the overall anti-streptococcal titers for a serum pool used in tiie screening procedure. 
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Antibodies against £, coli proteins were removed by incubating the heat-inactivated sera with whole cell 
E. coli cells (DHSalpha, transformed with pHEBll, grown under the same condition as used for bacterial 
surface display). Highly enriched preparations of IgGs from the pooled, depleted sera were generated by 
protein G affinity chromatography, according to the manufacturer's instructions (UltraLink Immobilized 
Protein G, Pierce). IgA antibodies were purified also by affinity chromatography using biotin-labeled 
anti-human IgA (Southern Biotech) immobilized on Streptavidin-agarose (GIBCO BRL). The efficiency of 
depletion and purification was checked by SDS-PAGE, Western blotting, ELI5A and protein 
concentration measurements. 

Results 

The antibodies produced against S. pneumoniae by the human immune system and present in human sera 
are indicative of the in vivo expression of the antigenic proteins and their immimogenicity. These 
molecules are essential for the identification of individual antigens in the approach as described in the 
present invention, which is based on the interaction of the specific anti-streptococcal antibodies and the 
corresponding S. pneumoniae peptides or proteins. To gain access to relevant antibody repertoires, human 
sera were collected from 

I. convalescent patients with invasive S. pneumoniae infections, such as pneumonia, bacteraemia 
and meningitis. (S. pneumoniae was shown to be the causative agent by medical microbiological tests), 

n healthy adults without carriage at the time of sampling. S. pneumoniae colonization and 
infections are common, and smtibodies are present as a consequence of natural immunization from 
previous encounters. 

97 serum samples from patien and 50 sera from healthy adults were diaracterized for anti-S. pneumoniae 
antibodies by a series of immxme assays. Primary characterization was done by EOSA using two 
different antigen preparations, such as total bacterial extract and culture supernatant proteins prepared 
from S, pneunumiae serotype 4 uncapsulated mutant strain. It is an important aspect that we analysed 
xmcapsulated strain, since we avoided ttie reactivities coming from serot3rpe specific abimdant anti- 
capsular polysaccharide antibodies. 

Recentiy it was reported that not only IgG, but also IgA serum antibodies can be recognized by ftie FcRm 
receptors of PMNs and promote opsonization {Phillips-Quagliata, J. et al., 2000}; {Shibuya, A. et al., 2000). 
The primary role of IgA antibodies is neutralization, mainly at the mucosal surface. The level of serum 
IgA reflects the quality, quantity and specificity of the dimeric secretory IgA. For that reason the serum 
collection was not only analyzed for anti-streptococcal IgG, but also for IgA levels. In the EOSA assays 
highly specific secondary reagents were used to detect antibodies from the high affinity types, such as 
IgG and IgA, but avoided IgM. Production of antibodies occurs durix^ the primary adaptive 
himioral response, and results in low affinity antibodies, while IgG and IgA antibodies had already 
undergone affinity maturation, and are more valuable in fighting or preventing disease. Antibody titers 
were compared at given dilutions where the response was linear (Fig. lA and IB.). Sera were ranked 
based on the IgG and IgA reactivity against the two complex antigenic mixtures, and the highest ones 
were selected for further testing by inununoblotting. This analysis confirmed a high antibody reactivity 
of the pre-selected sera against multiple pneumococcal proteins, especially when compared to not 
selected, low-titer sera (Fig IC). ELISA ranking of sera also correlated very well with surface staining of 
the same S. pneumoniae strain (Fig. ID and IE) suggesting that the majority of the antibodies detected by 
ELISA corresponded to surface antigens. This ectensive antibody characterization approach has led to the 
tuiambiguous identification of anti-pneumococcal hyperimmxme sera. 

Selected sera, 2x5 from both the patient and healthy donor groups were pooled to further enrich for 
abundant antibodies, but stiU having a representation of antibody repertoires of different individuals. 
IgG and IgA antibodies were purified from pooled sera by affinity chromatography and depleted of E. 
coli -reactive antibodies to avoid background in the bacterial surface display screen. 
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Example 2: Geneialion of highly random, frame-selected, small-fragment genomic DNA libraries of 
Streptococcus pnemnoniae 

Experimental procedures 

Preparation of streptococcal genomic DNA. 50 ml Todd-Hewitt Broth medium was inoculated with S. 
piieiiinofiiae serotype 4 (clinical isolate, typed with conventional serotyping) bacteria from a frozen stab 
and grown with aeration and shaking for 18 h at 37**C The culture was then harvested, centrifuged with 
l,600x g for 15 min and the supernatant was removed. Bacterial pellets were washed 3 x with PBS and 
carefully re-suspended in 0.5 ml of Lyso2yme solution (100 mg/ml). 0,1 ml of 10 mg/ml heat treated 
RNase A and 20 U of RNase Tl were added, mixed carefully and the solution was incubated for 1 h at 
37^C. Following tiie addition of 0^ ml of 20 % SDS solution and 0.1 ml of Proteinase K (10 mg/ml) the 
tube was incubated overnight at 55X. 1/3 volume of saturated NaQ was then added and the solution 
was incubated for 20 min at 4°C The extract was pelleted in a microfuge (13,000 rpm) and the 
supernatant transferred into a new tube. The solution was extracted with PhOH/CHCb/IAA (25:24:1) and 
wiih CHCb/IAA (24:1). DNA was precipitated at room temperature by adding 0.6x volume of 
Isopropanol, spooled from the solution with a sterile Pasteur pipette and transferred into tubes 
containing 80% ice-cold etfianol. DNA was recovered by centrifuging the precipitates with 10-12,00Qx g, 
tiien dried on air and dissolved in ddH20. 

Preparation cf small genomic DNA fragments. Genomic DNA fragments were mechanically sheared into 
fragments ranging in size between 150 and 300 bp using a cup-horn sonicator (Bandelin Sonoplus UV 
2200 soTucator equipped with a BBS cup horn, 10 sec. pulses at 100 % power output) or into fragments of 
size between 50 and 70 bp by mild DNase I treatment (Novagen). It was observed that sonication yielded 
a much tighter fragment size distribution when breaking the DNA into fragments of the 150-300 bp size 
range. However, despite extensive exposure of the DNA to ultrasonic wave-induced hydromechanical 
shearing force, subsequent decrease in fragment size could not be efficiently and reprodudbly achieved. 
Therefore, fragments of 50 to 70 bp in size were obtained by mild DNase I treatment using Novagen's 
shotgun cleavage kit. A 1:20 dilution of DNase I provided with the kit was prepared and the digestion 
was performed in the presence of MnCk in a 60 fil volume at 20°C for 5 min to ensure double-stranded 
cleavage by the enz3rme. Reactions were stopped with 2 |il of 0^ M EDTA and ihe fragmentation 
efficiency was evaluated on a 2% TAE-agarose gel. This treatment resulted in total fragmentation of 
genomic DNA into near 50-70 bp fragments. Fragments were then bltmt-ended twice using T4 DNA 
Polymerase in the presmce of 100 each of dNTPs to ensure efEdent flushing of the ends. Fragments 
were used immediately in ligation reactions or frozen at -20°C for subsequent use. 

Description of the vectors. The vector pMAL4.31 was constructed on a pASK-IBA backbone {Skerra, A., 
1994) with the beta-lactamase (bla) gene exchanged with the Kanamydn resistance gene. In addition the 
bla gene was cloned into the multiple doning site. The sequence encoding mature beta-lactamase is 
preceded by the leader peptide sequence of ompA to allow effident secretion across the c3rtoplasmic 
membrane. Furthermore a sequence encoding the first 12 amino adds (spacer sequence) of mature beta- 
lactamase follows the ompA leader peptide sequence to avoid fusion of sequences immediately after the 
leader peptidase deavage site, since e.g. dusters of positive charged amino adds in this region would 
decreeise or abolish translocation across the c3rtoplasmic membrane (Kajava, A. et al., 2000). A Smdi 
restriction site serves for library insertion. An upstream Fsel site and a downstream NotI site, which were 
used for recovery of the selected fragment, flank the Smdi site. The three restriction sites are inserted after 
the sequence encoding the 12 amino add spacer sequence in such a way that the bla gene is transcribed in 
the -1 reading frame resulting in a stop codon 15 bp after the Natl site A +1 bp insertion restores the bla 
ORF so that beta-lactamase protein is produced with a consequent gain of Ampidllin resistance. 

The vector pMAL9.1 was construded by doning the lamB gene into the multiple doning site of pEHl 
{Hashemzadeh-Bonehi, L. et al., 1998}. Subsequently, a sequence was inserted in lamB after amino add 
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154, contaiiiing the restriction sites Fsel, Snud and NofL The reading frame for this insertion was 
constructed in such a way that transfer of frame-selected DNA fragments excised by digestion with Fsel 
and Notl from plasmid pMAL4.31 yields a continuous reading frame of lamB and the respective insert 

The vector pMALlO.l was constructed by doning the htuB gene into ihe multiple cloning site of pEHl. 
Subsequentiy, a sequence was inserted in btuB after amino add 236, containing the restriction sites Fsel, 
Xbal and Notl. The reading frame for tiiis insertion was chosen in a way that transfer of frame-selected 
DNA fragments excised by digestion with Fsel and Noil from plasmid pMAL4.31 3delds a continuous 
reading frame of btuB and the respective insert. 

Hie vector pHIEll was constructed by doning the flmA gene into the mtdtiple cloning site of pEHl. 
Thereafter, a sequence was inserted mfltiiA after amino add 405, containing the restriction site Fsel, Xbal 
and NotL The reading frame for tiKis insertion was chosen in a way that transfer of frame-selected DNA 
fragments excised by digestion with Fsel and Notl from plasmid pMAL4.31 yields a continuous reading 
frame of JhuA and the respective insert 

Ooning and evaluation of the library for frame selection. Genomic S. pneumoniae DNA fragments were ligated 
into the Smal site of the vector pMAL4.31. Recombinant DNA was electroporated into DHIOB 
electrocompetent E. coli cells (GIBCO BRL) and transformants plated on LB-agar supplemented with 
Kanamycin (50 |-ig/ml) and Ampidllin (50 fxg/ml). Plates were incubated over night at 37°C and colonies 
collected for large scale DNA extraction. A representative plate was stored and saved for collecting 
colonies for colony PGR armlysis and large-scale sequencing. A simple colony PGR assay was used to 
initially determine the rough fragment size distribution as well as insertion effidency. From sequencing 
data the precise fragment size was evaluated, junction intactness at the insertion site as weU as tiie frame 
selection accuracy (3n+l rule). 

Cloning and evduatian of the library for bacterial surface display. Genomic DNA fragments were excised from 
the pMAL4.31 vector, containing the S. pneumoniae library with the restriction enzymes Fsel and Notl. The 
entire population of fragments was then transferred into plasmids pMAL9.1 (LamB) or pHIEll (FhuA), 
which have been digested with Fsel and NotL Using these two restriction enzjnnes, whidi recognise an 8 
bp GC rich sequence, the reading frame that was selected in the pMAL4.31 vector is maintained in each of 
the platform vectors. The plasmid library was then transformed into £. coli DHSalpha cells by 
electroporation. Gells were plated onto large LB-agar plates supplemented with 50 ^g/ml Kanamycin and 
grown over night at 37**G at a density yielding dearly visible single colonies. Cells were then scraped off 
the surface of these plates, washed with fresh LB medium and stored in aliquots for library screening at - 

Results 

Libraries for frame selection. Two libraries (LSPn70 and LSPn300) were generated in the pMAL4.31 vector 
with sizes of approximately 70 and 300 bp, respectively. For each library, ligation and subsequent 
transformation of approximately 1 jag of pMAL4.31 plasmid DNA and 50 ng of fragmented genomic S. 
pneumoniae DNA )rielded 4x lO^ to 2x 10^ dones after frame selection. To assess the randomness of the 
libraries, approximately 600 randomly chosen dones of LSPn70 were sequenced. The bioinformatic 
anal3rsis showed that of these clones only very few were present more than once. Furthermore, it was 
shown that 90% of the dones fell in the size range between 25 and 100 bp with an average size of 52 bp 
(Figure 2). AUmost all sequences followed die 3n+l rule, showing that all dones were properly frame 
selected. 

Bacterial surface display libraries. The display of peptides on the surface of E. coli required the transfer of the 
inserts from the LSPn libraries firom the frame selection vector pMAL431 to the display plasmids 
pMAL9.1 (LamB) or pHIEll (FhuA). Genomic DNA fragments were excised by Fsel and Notl restriction 
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and ligation of 5ng inserts with 0.1|ig plasmid DNA and subsequent transformation into DHSalpha cells 
resulted in 2-5x lO® clones. The dones were scraped off the LB plates and frozen without further 
amplification. 

Example 3: Identification of highly immunogenic peptide sequences from S. pneumoniae using 
bacterial surface displayed genomic libraries and human serum 

Experimental procedures 

MACS screening. Approximately 25x lO^ cells from a given library were grown in 5 ml LB-mediiun 
supplemented with 50 jig/ml Kanamydn for 2 h at 37^CL Expression was induced by the addition of 1 
mM IPTG for 30 min. Cells were washed twice with fresh LB medium and approximately 2x 10^ cells re- 
suspended in 100 |jil LB mediiun and transferred to an Eppendorf tube. 

10 ^g of biotinylated, human IgGs purified from serum was added to the ceUs and the suspension 
incubated overnight at 4°C with gentle shaking. 900 pi of LB meditim was added^ the suspension mixed 
and subsequently centrifuged for 10 min at 6,000 rpm at 4®C (For IgA screens, 10 \xg of purified IgAs 
were used and these captured with biotinylated anti-human-IgG secondary antibodies). Cells were 
washed once with 1 ml LB and then re-suspended in 100 |al LB medium. 10 ^1 of MACS microbeads 
coupled to streptavidin (Miltenyi Biotech, Germany) were added and the incubation continued for 20 min 
at 4*'C. Hiereafter 900 (il of LB mediiun was added and ttie MACS microbead cell suspension was loaded 
onto tiie equilibrated MS column (Miltenyi Biotech, Germany) which was fixed to the magnet. (The MS 
columns were equilibrated by washing once with 1 ml 70% EtOH and twice witii 2 ml LB medium.) 

The coliunn was then washed three times witii 3 ml LB medium. After removal of the magnet, cells were 
eluted by washing with 2 ml LB medium. After washing the column with 3 ml LB mediimi, the 2 ml 
eluate was loaded a second time on the same column and the washing and elution process repeated. The 
loading, washing and elution process was performed a third time, resulting in a final eluate of 2 ml. 

A second roimd of screening was performed as follows. The cells from the final eluate were collected by 
centrifugation and re-suspended in 1 ml LB medium supplemented with 50 |-ig/ml Kanamycin. The 
culture was incubated at 37*'C for 90 min and then induced with 1 mM IPTG for 30 min. Cells were 
subsequently collected, washed once with 1 ml LB medium and suspended in 10 {jI LB meditun. 10 \ig of 
human, biotinylated IgGs were added again and the suspension incubated over night at 4°C with gentle 
shaking. All further steps were exactly the same as in the first selection roimd. Cells selected after two 
rounds of selection were plated onto LB-agar plates supplemented with 50 ^g/mi Kanamycin and grown 
over night at 37**C. 

Evaluation cf selected clones by sequencing and Western blot analysis. Sdected dones were grown overnight at 
37°C in 3 ml LB medium supplemented with 50 fig/ml Kanamycin to prepare plasmid DNA using 
standard procedures. Sequencing was performed at MWG (Germany) or in collaboration with TIGR 
(U.SJ^.). 

For Western blot analysis approximatdy 10 to 20 jig of total cellular protein was separated by 10% SDS- 
PAGE and blotted onto HybondC membrane (Amersham Fharmada Biotech, England). The LamB or 
FhuA fusion proteins were detected using human senmi as the primary antibody at a dilution of 
approximatdy 1:5,000 and anti-human IgG or IgA antibodies coupled to HRP at a dilution of 1:5,000 as 
secondary antibodies. Detection was performed using the ECL detection kit (Amersham Fharmada 
Biotech, England). Altemativdy, rabbit anti-FhuA or rabbit anti-LamB polydonal immune sera were 
used as primary antibodies in combination with the respective secondary antibodies coupled to HKP for 
the detection of the fusion proteins. 
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Results 

Screening of bacterid surface display libraries by magnetic activated cell $orti7tg (MACS) using biotinylated Igs. 
The libraries LSPn70 in pMAL9.1 and LSPnSOO in pHIEll were screened with pools of biotinylated, 
human IgGs and IgAs from patient sera or sera from healthy individuals (see Example 1: Preparation of 
antibodies from human serum). The selection procediare was performed as described imder Experimental 
procedures. Figure 3A shows a representative example of a screen with the LSPn-70 library and PSPn3- 
IgGs. As can be seen from the colony count after the first selection cycle from MACS screening, the total 
number of cells recovered at the end is drastically reduced from 2x107 cells to approximately 5x 10^ cells, 
whereas the selection without antibodies added showed a reduction to about 2xl(P cells (Figure 3A). 
After the second roxmd, a similar ntimber of cells was recovered with PSPn3-IgGs, while fewer than 10 
cells were recovered when no IgGs from htunan serum were added, dearly showing that selection w£is 
dependent on S. pneumoniae specific antibodies. To evaluate the performance of the screen, 26 selected 
clones were picked randomly and subjected to immimoblot analysis with screening IgG pool (PSPnT) 
(Figure 3B). This analysis revealed that -^0% of the selected clones showed reactivity with antibodies 
present in the relevant serum whereas tiie control strain expressing LamB without a S, pneumoniae 
specific insert did not react with the same serum. In general, the rate of reactivity was observed to lie 
within the range of 35 to 90%. Colony PGR analysis showed that all selected dones contained an insert in 
the expected size range. 

Subsequent sequencing of a larger number of randomly picked dones (600 to 1200 per screen) led to the 
identification of the gene and the corresponding peptide or protein sequence that was spedfically 
recognized by the hvunan serum antibodies used for screening. The frequency with which a specific done 
is selected reflects at least in part the abundance and/or affinity of the specific antibodies in the serum 
used for selection and recognizing the epitope presented by this done. In that regard it is striking that 
dones derived from some ORFs (e.g. SF2216, SP0117, SP0641, SP2136, SP2190, SP0107, SP0082) were 
picked more than 100 times^ indicating their highly immunogenic property. Table 1 summarizes the data 
obtained for all 12 performed screens. All dones that are presented in Table 1 have been verified by 
immunoblot analysis using whole cellular extracts from single dones to show the indicated reactivity 
with the pool of human serum used in the respective screen. As can be seen from Table 1, distinct regions 
of the identified ORF are identified as immunogenic since variably sized fragments of the proteins are 
displayed on the surfece by the platform proteins. 

It is further worth noticing that most of the genes identified by the bacterial surface display screai encode 
proteins that are either attached to the surface of S. pneumoniae and/or are secreted. This is in accordance 
with the expected role of siurface attached or secreted proteins in virulence of S. pneumoniae. 

Example 4: Assessment of the reactivity of highly immunogenic peptide sequences with individual 
human sera. 

Experimental procedures 

Peptide synthesis 

Peptides were synthesized in small scale (4 mg resin; up to 288 in parallel) using standard F-moc 
chemistry on a Rink amide resin (PepChem, Tubingen, Germany) using a SyroII S3mthesizer 
(Multisyntech, Witten, Germany). After die sequence was assembled, peptides were elongated with 
Fmoc-epsilon-aminohexanoic add (as a linker) and biotin (Sigma, St Louis, MO; activated like a normal 
amino add). Peptides were deaved off the resin with 93%TFA, 5% triethylsilane, and 2% water for one 
hour. Peptides were dried under vacuum and freeze dried three times from acetonitrile/water (1:1). Hie 
presence of tiie correct mass was verified by mass spectrometry on a Refiex III MALDI-TOF (Bruker, 
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Bremen Germany). Hie peptides were used witiK>ut further purification. 
Enzipne linked immune assay (ELISA), 

Biotin-labeled peptides (at the N-terminus) were cxjated on Streptavidin ELISA plates (EXICON) at 10 
jig/ml concentration according to the manufacturer's instructions. Highly specific Horse Radish 
Peroxidase (HRF>conjugated anti-hiunan IgG secondary antibodies (Southern Biotech) were used 
according to the manufacturers' recommendations (dilution: l,OO0x). Sera were tested at two serum 
dilutions, 200X and l^OOOX. Following manual coating, peptide plates were processed and analyzed by 
tiie Gemini 160 EUSA robot (TECAN) with a built-in ELBA reader (GENIOS, TECAN). 

Approximately 110 patients and 60 healthy adult sera were included in the analysis. Following the 
bioinformatic analj/^is of selected clones, corresponding peptides were designed and synthesized. In case 
of epitopes with more than 26 amino add residues, overlapping peptides were made. All peptides were 
synthesized with a N-terminal biotin-tag and used as coating reagents on Streptavidin-coated ELISA 
plates. 

The analysis was performed in two steps. First, peptides were selected based on their reactivity with the 
individual sera, which were induded in tiie semm pools used for preparations of IgG and IgA screening 
reagents for bacterial surface display. A summary for serum reactivity of 224 peptides representing S. 
pneumoniae epitopes from the genomic screen analysed with 20 human sera (representing 4 different 
pools of five sera) used for the antigen identification is ^own in Table 2. The peptides were compared by 
the score calculated for each peptide based on the number of positive sera and the extent of reactivity. 
Peptides range firom highly and widely reactive to weakly positive ones. Among tiie most reactive ones 
there are known antigens, some of them are Edso protective in animal challenge models for 
neisopharyngeal carriage or sepsis (e.g. PspA/SP0117, serine protease/SP0641, histidine triad 
protein/SP1175). Peptides not displaying a positive reaction were not included in further, more detailed 
studies. 

Second, a large number of not pre-selected individual sera from patients with invasive pneumococcal 
disease or from healthy adults and children were tested against the peptides showing specific and high 
reactivity with the screening sera. Seroconversion during disease was tested for highly positive peptides 
by using three serial serum samples collected longitudinally from patients with invasive pneumococcal 
disease, the first before disease occurred (pre), the second in the acute phase (within 5 days after onset) 
and the third in the convalescent phase (> 3 weeks after onset) of the disease. Two representative ELISA 
experiments are shown with two different patients, displaying seroconversion to multiple peptides, 
suggesting that epitope-spedfic antibody levels were low before disease occurred, and were induced in 
the acute and convalescent phase (Fig. 5). The antigens showing this antibody profile are especially 
valuable for vaccine development (e.g. SP2216, SP2109, SP1175, SP0117, SP0082). 

Example 5: Gene distribution studies with highly immunogenic proteins identified from S. 
pneumoniae. 

Experimental procedures 

Gene distribution of pneumococcal antigens by PCR. An ideal vaccine antigen would be an antigen that is 
present in all, or the vast majority of strains of the target organism to which the vaccine is directed. In 
order to establish whether the genes encoding the identified Streptococcus pjteumoniae antigens occur 
ubiquitously in S. pneumoniae strains, PCR was performed on a series of independent S. pneumoniae 
isolates witii primers specific for the gene of interest. S. pneumoniae isolates were obtained covering the 
serotypes most frequentiy present in patients as shown in Figure 4A. Oligonucleotide sequences as 
primers were designed for aU identified ORFs yielding products of approximately 1,000 bp, if possible 
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covering all identified immunogenic epitopes. Genomic DNA of all S. pfteumoniae strains was prepared as 
described under Example 2. PGR was performed in a reaction volume of 25 fil using Taq polymerase 
(lU), 200 nM dNTPs, 10 pMoI of each oligonucleotide and the kit according to the manufacturers 
instructions (Invitrogen, The Netherlands), As standard, 30 cydes (Ix: 5min. 95°C, 30x: SOsec. 95°Q SOsea 
SS^'C, SOsec. 7T*(Z, Ix 4inin. 72**C) were performed, unless conditions had to be adapted for individual 
primer pairs. 

Results 

All identified genes encoding immunogenic proteins were tested by PGR for their presence in 50 different 
strains of S. pneumoniae (Figure 4A). As an example, figure 4B shows the PGR reaction for SP1604 with all 
indicated 50 strains. As clearly visible, the gene is present in all strains analysed. The PGR fragment from 
a type 14 strain was sequenced and showed that of 414 bp, 6 bp are different as compared to the S. 
pneumoniae type 4 strain, resulting in three amino add difference between the two isolates. 
From a total of 50 genes analysed, 31 were present in all strains tested, while 9 genes were absent in more 
than 10 of the tested 50 strains (Table 3). Several genes (SP0667, SP0930) showed variation in size and 
were not present in all strain isolates. Some genes showed variation in size, but were otherwise conserved 
in all tested strains. Sequencing of the generated PGR fragment from one strain and subsequent 
comparison to the tyipe 4 strain confirmed the amplification of the correct DNA fragment and revealed a 
degree of sequence divergence as indicated in Table 3. Importantly, many of the identified antigens are 
well conserved in all strains in sequence and size and are therefore novel vaccine candidates to prevent 
infections by pneumococd. 

Example 6: Chaiactetization of immune seia obtained £com mice immunized wifh highly 
immunogenic pzoteina/peptides from S. pneumoniae displayed on the surface of £• coli. 

Experimental procedures 

Generation of immune sera from mice 

E. coli dones harboring plasmids encoding the platform protein fused to a S. pneumoniae peptide, were 
grown in LB medium supplemented with 50^g/ml Kanamycin at 37**C Overnight cultures were diluted 
1:10, grown until an ODeoo of 0.5 and induced with 0.2 mM IPTG for 2 hours. Pelleted bacterial cells were 
suspended in PBS buffer and disrupted by sonication on ice, generating a crude cell extract. According to 
the OD600 measiu-ement, an aliquot corresponding to 5x10^ cells was mjected into NMRI mice i.v., 
followed by a boost after 2 weeks. Serum was taken 1 week after the second injection. Epitope specific 
antibody levels were measured by peptide ELISA. 

In vitro expression of antigens 

Expression of antigens by in vitro grown S. pneumoniae serotype 4 was tested by immunoblotting. 
Different growth media and culture conditions were tested to detect the presence of antigens in total 
lysates and bacterial culture supematants. Ejqjression was considered confirmed when a spedfic band 
corresponding to the predicted molecular weight and electrophoretic mobiHfy was detected. 

Cell surface staining 

Flow cytometric analysis was carried out as follows. Bacteria were grown imder culture conditions, 
which resulted in expression of the antigen as shown by the immunoblot analysis. Cells were washed 
twice in Hanks Balanced Salt Solution (HBSS) and the cell density was adjusted to approximately 1 X 10^ 
CFU in lOO^il HBSS, 0.5% BSA. After incubation for 30 to 60 min at 4^G with mouse antisera diluted 50 to 
100-fold, unboimd antibodies were washed away by centrifugation in excess HBSS, 0.5% BSA. Secondary 
goat anti-mouse antibody (F(ab')2 fragment specific) labeled with fluorescein (FITC) was incubated with 
tiie cells at 4^G for 30 to 60 min. After washing, cells were fixed with 2% paraformaldehyde. Bound 
antibodies were detected using a Becton Dickinson FAGScan flow cytometer and data further analyzed 
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with the computer program CELLQuest. Negative control sera included mouse pre-immune serum and 
mouse polyclonal serum generated with l3^ates prepared from IPTG induced £. coli cells transformed 
with plasmids mcoding the genes lamB oxJhuA without S. jnieumoniae genomic insert. 

Bactericidal (killing) assay 

Miurine macrophage cells (RAW246.7 or P388.D1) and bacteria were incubated and the loss of viable 
bacteria after 60 min was determined by colony counting. In brief, bacteria were washed twice in Hanks 
Balanced Salt Solution (HBSS) and ttie cell density was adjusted to approximately IX IQs CFU in 50|li1 
HBSS. Bacteria were incubated with mouse sera (up to 25%) and guinea pig complement (up to 5%) in a 
total volume of 10P|al for 60min at 4'*C. Pre-opsonized bacteria were mixed with macrophages (murine 
cell line RAW264.7 or P388.D1; 2X 10^ cells per lOOfil) at a 1:20 ratio and were incubated at 37°C on a 
rotating shaker at 500 rpm. An aliquot of each sample was diluted in sterile water and incubated for 5 
min at room temperature to ly^e macrophages. Serial dilutions were then plated onto Todd-Hewitt Broth 
agar plates. The plates were incubated overnight at 37*Q and the colonies were counted with the 
Countermat flash colony coimter (lUL Instruments). Control sera included mouse pre-inunune serum 
and mouse polyclonal serum generated with lysates prepared from IPTG induced E. coli transformed 
with plasmids harboring the genes lamB or flvuA without S. pneumoniae genomic insert. 

Results 

In vitro expression of antigens. The expression of the antigenic proteins was anal3rzed in xntro in S. 
pneumoniae serotype 4 by using sera raised against E. coli clones harboring plasmids encoding the 
platform protein fused to a S. pneumoniae peptide. First, the presence of specific antibodies was 
determined by peptide ELISA and/or immunoblotting using the E. coli done expressing the given epitope 
embedded in LamB or FhuA platform proteins. Positive sera were then analysed by immimblotting using 
total bacterial lysates and culture supematants prepared from S. p}%eumoniae serotjrpe 4 strain (data not 
shown). This analysis served as a first step to determine whether a protein is expressed at all, and if, 
under which growth conditions, in order to evaluate surface expression of the polypeptide by FACS 
analysis. It was anticipated based on literature data that not all proteins would be expressed under in 
vitro conditions. 

Cell surface staining of S. pneumoniae. Cell surface accessibility for several antigenic proteins was 
subsequently demonstrated by an assay based on flow cytometry. Streptococci were incubated with 
preimmime and polyclonal mouse sera raised against S. pneumoniae lysate or E. coli clones harboring 
plasmids encoding the platform protein fused to a S. pneumoniae peptide, follow by detection with 
fluorescmtly tagged secondary antibody. As shown in Fig. 6A, antisera raised against S. pneumoniae 
lysate contains antibodies against surface components, demonstrated by a significant shift in fluorescence 
of the S. pneumoniae serotype 4 cell population. Similar cell surface stairung of S. pneumoniae sertj^e 4 
cells was observed witii polyclonal sera raised against peptides of many of the pneumococcal antigens 
identified (Fig. 6B and Table 4.). In some instancies, a subpopulation of the bacteria was not stained, as 
indicated by the detection of two peaks in the histograms ^ig. 6B). This phenomenon may be a result of 
differential expression of tfie gene products during the growth of the bacteriuirv insufficient antibody 
levels or partial inhibition of antibody binding caused by other surface molecules or plasma proteins. 

In vitro bactericidal actixnty, Opsonophagoc3rtic killing is tiie cornerstone of host defeitse against 
extracellular bacteria, such as S. pneumoniae. Cell surface binding of antibodies to bacterial antigens are 
opsonizing and induce killing (bactericidal) by phagocytic cells (macrophages and neutrophil 
granulocytes) if the antibodies induced by the particular antigens can bind activated complement 
components (C3bi). It has been shown that anti-pneumococcal bactericidal activity of hxmian sera 
measured in m vitro assays can be correlated with in vivo protection of vaccinated individuals {Romero- 
Steiner, S. et al., 1999}. In Figure 7 examples are shown and in Table 4 a summary is presented on 
bactericidal activity measured by antigen-spedfic antibodies generated in mice with corresponding 
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epitopes. According to these data, several of the novel pneumococcal antig^is induce functional 
antibodies (eg. SP0082, SP2216, SP2136, SP0454, SP0069, SP0369, etc.). Importantly, a weU-known 
protective pneumoniae antigerv PspA (SP0117) is proved to be strongly positive in the very same assay. 

These experiments confirmed the bioinformatic prediction that many of the proteins are exported due to 
liieir signal peptide sequence and in addition showed that they are present on the cell surface of S. 
pneumoniae serot3rpe 4. TTiey also confirm that these proteins are available for recognition by human 
antibodies with functional properties and make them valuable candidates for the development of a 
vaccine against pneumococcal diseases. 

Example 7: Identification of pneumococcal antigens inducing protective immune responses 

Experimental procedures 

Expression cf recombinant pneumococcal proteins 

aoning of genes/DNA fragments: T he gene/DNA fragment of interest was amplified from the genomic 
DNA of S pneumoniae (strain T4, Capsular type 4) by PGR using gene specific primers. Apart from the 
gene specific part, the primers had restriction sites that aided in a directional doning of the amplified 
PGR product. The gene aimealing (specific) part of the primer ranged between 15-24 bases in length. The 
PGR products obtained were digested with the appropriate restriction enz3nne and doned into pET28b(+) 
vector (NOVAGEN), Once the recombinant plasmid was confirmed to contain the gene of interest, E coli 
BL21 star® cells (EsJVriKOGEN) that served as expression hosts were transformed. These cells were 
optimized to effidently express the gene of interest. Expression and purification of proteins: E coli 6L21 
star® cells harbouring the recombinant plasmid was grown until log phase in a required culture volume. 
Once the ODdoonm of 0.8 was readied the culture was induced with 1 mM IPTG for 3 hours at 37<>C. The 
cells were harvested by centrifugatioiv lysed by a combination of freeze-thaw method followed by 
disruption of cells with 'Bug-bxister* NOVAGEN'. The lysate was separated by centrifugation into 
soluble (supernatant) and insoluble (pellet) fractions. Depending on the location of the protein different 
purification strategies was foUowed. In case the protein was in the soluble fraction, purification of the 
protein was done by binding the above supenuitant with Ni-Agarose beads (Ni-NTA-Agarose®, 
QIAGEN). Due to the presence of the penta Histidine (HIS) at the G or N or both tennini of the expressed 
protein, it bound to the Ni-agarose while the ottier contaminating proteins were washed from the colunm 
by wash buffer. The proteins were eluted by 100 mM immidazole and the eluate was concentrated, 
assayed by Bradford for protein concentration and checked by PAGE and Western blot. In case the 
protein was present in the insoluble fraction the pellet was solubilized in buffer containing 8 M Urea. The 
purification was done under denaturing conditions (in buffer containing 8M Urea) using the same 
materials and procedure as mentioned above. The eluate was concentrated and dialyzed to remove all the 
urea in a gradual stepwise manner. The proteins were checked by SDS-PAGE and concentrations 
measured by Bradford method. 

Animal protection studies 

Anii^als: G3H (HeNHsd; A, B, G, D: agouti, wild type, inbred) female mice were used. Active 
immunization: 50 |Lig of recombinant proteins were injected subcutaneously and adjuvanted with 
Gomplete Freud Adjuvant (GFA). Animals were boosted twice with the same amount of protein, but 
adjuvanted with Incomplete Freund Adjuvant (EFA) at days 14 and 28. A well-known protective antigen 
PspA (SP0117) was used as a positive control, while nonimmunized (PBS or CP A/IF A adjuvant injected) 
mice served as negative controls. Antibody titers were measured at days 35-38 by ELISA using the 
respective recombinant proteins, and were determined to be in the range of 200.00-1.000.000 (end-point 
titer). Passive immimization: Naive mice were injected with 150-300fil mouse sera intraperitoneally 2hrs 
before intraperitoneal inoculation with S. pneumoniae. Bacterial challenge: A frozen glycerol stock of S. 
pneumoniae serotype 6B was prepared and used for all experiments. The approximate estimated cell 
nimiber was determined by OD600nm measurements. In order to determine the real viable cell numbers 
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present in the inoculum prepared from the frozen glycerol stock cultures, cfus were determined via 
plating using six serial dilutions. 10*-10» bacteria/mouse was injected either intravenously in the tail vein 
or intraperitoneaUy. Protective effect of immtmizations was measured by monitoring survival rates for 2 
to 3 weeks post-challenge and was expressed in % of total number of animals (10/ group). 



Restdts 

In the present invention six different pneumococcal antigens identified by bacterial surface display were 
determined to have protective effect in a mouse sepsis/lethality model. The best levels of protection were 
achieved by inununization with recombinant antigens representing the SP2190, SP2216 and SP0667 
protein, while SP0368, SP1732 and SP0498 displayed lower levels of protection (Fig. 8). The protective 
effect was mediated by antibodies as it has been demonstrated by passive serum transfer experiments 
(Fig. 9). Naive mice receiving specific anti-SP2190 and anti-SP2216 antibodies were protected from death 
relative to mice from the negative control groups. Importantly, the combination of these antigens resulted 
in improved protection, as it has been shown in Fig. 9. Passive immunization with 150 jil immime serum 
generated either with recombinant SP2190 or with recombinant SP2216 (supplemented with 150\jd naive 
serum) resulted in lower survival rate compared to serum therapy with lOOfil each of specific antisera 
(supplemented with 100|il naive serum). These experiments strongly support that combination of these 
antigens has beneficial effects in vaccination against pneumococcal diseases. 

Since the antigens used for immimization were derived from a serotjrpe 4 strain and the challenge strain 
was a serotype 6B, these experiments established that the antigens were cross-protective. 

The SP2216, SP2190 and SP1732 recombinant proteins detected the highest levels of antibodies in sera of 
patients convalescing from invasive pneumococcal diseases, as well as in those of healthy individuals 
exposed to Pneumococcus (diildren in the household) (data not shown). The most frequently identified 
antigen in bacterial surface display screens was the SP2216 protein. It was a special interest to compare 
the protectivity of the subdomains of this protein selected (N-terminal amino add sequences) or not 
selected (C-terminal amino add sequences) by human antibodies (Fig. lOA). Upon immunization with the 
two different domains (expressed as recombinant antigens) it became evident that the inununogenic part 
of the SP2216 protein carried the protective potential, while the non-selected domain was ineffective and 
comparable to the negative control (Fig. lOB). Based on this experiment the epitopes detected by bacterial 
surface display identifies protective epitopes and regions of bacterial proteins and this information can be 
used for rational design of subimit vaccines based on the antigens described in the present invention. 

Example 8: Detezmination of sequence conservation of protective antigens 

Experimental procedures 

Immunoblotting 

Total bacterial lysate and culture supernatant samples were prepared from m vitro grown S. pneumoniae 
strains. 60 (dinical isolates) representing 48 different serotj^es were induded in the study. 
Approximately 25jig total protein/lane was separated by SDS-PAGE using the BioRad Mini-Protean 3 
Cell electrophoresis system and proteins transferred to nitrocdlulose membrane (ECL, Amersham 
Pharmada). After ovemiglht blocking in 5% milk, hyperimmune mouse sera generated by immunization 
with the recombinant proteins SP2216, SP1732 and SP2190 (and SP0117/PspA as internal control) derived 
from serotype 4 strain were added at 5,000x dilution, and HRPO labeled anti-mouse IgG was used for 
detection. 



DNA sequencing 
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The genes of SP1732, SP2190 and SP22126 were amplified from the genomic DNA of S. pneumoniae 
(serotype4> 6B, 9W, 14, 18C, 19F and23F) by PGR using a proofreading polymerase Expand (ROCHE). 
Gene specific primers, ranging between 27-31 bases in length, were used to amplify the entire open 
reading frames. The PGR products obtained were cloned into pCR®2.1-TOPO vector (Invitrogen). The 
recombinant plcismid DNA was purified using a QIAprep® miniprep kit (Qiagen) before the sequence 
was confirmed (MWG). In addition to the seven serotypes, the gene of SP2216 from other 41 different 
serotj^es was amplified by PGR and the purified PGR products were sequenced. 

Results 

Identification of conserved antigens inducing antibodies that are cross-reactive with different diiucal 
isolates is crucial for fiie development effective vacdnes. It is especially relevant for protein-based 
vaccines targeting pneumococcal diseases, since more then 90 different serotj^es of Streptococcus 
pnuemoniae (Pneumococcus) have been associated with human infections. 

In a thorough analysis it was determined that the antibodies induced by SP2216, SP2190 and SP1732 all 
derived from a serotype 4 strain broadly cross-reacted with all the different serotypes tested in the 
immimoblot analysis (Pig. 11). Notably, the SP2190 antigen that showed variation in electrophoretic 
mobility (indicating different sizes) preserved the antibody reactivity strongly suggesting that 
immimodominant epitopes are conserved. In contrast we detected lower cross-reactivity with anti-PspA 
antibodies that is in accordance with the known differences in immunogenic amino add sequences of this 
antigen. 

In order to directiy address the question whether the identified protective antigens are conserved among 
the different serotypes of S. pneumoniae, DNA sequence analysis was performed on the SP2216, SP1732 
and SP2190'genes. SP2216 and SP1732 are highly conserved, only few amino add changes were detected. 
The SP2216 gene was sequenced from 47 different dinical isolates representing 47 different S. pneumoniae 
serotypes and only single amino add exchanges were detected and only in 2 of the analysed strains (Fig. 
12). In the SP1732 gene one or two amino add exchanges were detected in the majority (in four of the six) 
of strains analysed (Fig. 13). The 2190 antigen showed a great variability in the amino add sequences of 
the corresponding genes as it is shown in Fig. 14. The insertions and deletions makes it difficult to 
calculate an exact amino add homology among the different SP2190 variants, but it can be estimated to be 
between approx. 60 and 90%. However, the amino add identity was suffident to induce cross-reactive 
and cross-protective antibodies based on the experiments presented in Fig. 8,9 and 11. 
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Table 1: Isuxumogenic pxoteixis idenfified l^y bacterial sojcEaoe display. 



S. pneumoniae 
antigenic 
protein 


Patafiye Sanction 
(by homology) 


predicted inuxnmogenic aa** 


No. of 
selected 
donespei 
ORFand 

screen 


Location of 
idenfified 
inuminogenic 
region (aa) 


Seq. 
ID 
(DNA, 
PtoU 


SP0008 


tiypothetical protein 


1-11,35-64^76,101-108411-119 


S:15 


57-114 


1,145 


5P0032 


DNA polymerase I 
(polA) 


5-27,32-64,92-102,107-113419-125,133-139448- 

l62,177-187,195-201,207-214>241-251,254-269,285- 

300^-^9,317-324332-357,365-404,411-425443- ^ 

463470-477,479-487,506^12r515-520,532-547,556- 

596,603-610,616-622^24^29,636-642,646-665,667- 

674^7-692,708-720,734-739,752-757,798-820,824- 

851,856-865 


EJ:39,I;6, 
U2 


732-763 


2,146 


SP0069 


Choline binding 
protein I 


14-21,36-44,49-66,102-127462-167477-196 


G:1,H:2, 
E:1,K:44, 
L:3,M:1 


45-109 
145-172 


3,147 


5P0071 


immunoglobiilin Al 
protease (iga-1) 


17^,64-75^1-92,100-119,125-172,174-183,214- 

222,230-236,273-28^7-303,310-315,331-340,392- 

398412-42048aa05,51S-523,525-546^575,592- 

598,603-609,617-625,631-639,644-651,658-670,681- 

687,691-704,709-716,731-736,739-744,750-763,774- 

780,784-791,799-805,809^22,859-870,880-885,907- 

916,924-941,943-949,973-9864010-1016,1026- 

1036,1045-1054,1057-1062^1082-10884095-1102,1109- 

11204127-1134,1140.11464152-1159,1169-1179,1187- 

1196,1243-1251,1262-1273,1279-1292,1306-1312,1332- 

1343,1348-1364,1379-1390,1412-1420,1427-1436,1458- 

14684483-15034524-15494574-1588,1614-16194672- 

16854697-1707,1711-1720,1738-1753,1781-17874796- 

1801,1826-1843 


Ai3, Cil, 

D:9,B.-9, 

F:4,G:21, 

fc34K:61, 

L:20,M:2 


132-478 
508-592 
1753-1810 


4,148 


5F0082 


CeU wall surface 
anchor 


15-43,49-55,71-77,104-110,123-130,162-171,180- 

192,199-205,219-227,246-254,264-270,279-287,293- 

308,312-322,330-3fi349-356,369-a77,384-394^1- 

406,416-422,432-439450-460,464474482-494,501- 

508,52W29,53fr546,553-558,568-S74,584-591,602- 

612,616-626,634-646,653-660,673-^1,688-698,705- 

710,720-726,736-749333-848 


C:9,E:4, 
F:2,I:26, 
U4>M-67 


1-199 
200-337 
418-^ 
549-647 


5,149 


SP0107 


LysM domain protein 


9-30,65-96,99-123,170-178 


A:3,B:16, 
C:15,D:1 

F :178, 


1-128 


6,150 
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S. pneuntottiae 
antigenic 
ptotein 


Putative function 
(by homology) 


predicted inmnmogenic aa** 


No.of 
selected 
clones per 
ORFand 

screen 


Location of 
identified 
immunogenic 
region (aa) 


Seq. 
ID 
(DNA, 
Prot) 








M:l 






SP0117 


pneumococcal surface 
protein A (pspA) 


7n3234-41,96-106,127-136,154rl63488-199;j07- 
238^72''279.30«1^318-325^1-347353^^7- 
393^99-406,434-440,452-503^75^^9^1^615- 
S20,635-640,654-660,674-680,696-7Dl,7ia-731 


A:13,B:11, 

C:10,D:4. 

E:31,F:6, 

G:33, 

H:13,I:9, 

B064L:32, 

M:46 


1-548 
660-691 


7,151 


SP0191 


hypothetical protein 


4-19,35-4448-59,77-87,93-99406-111,130-138,146-161 


E:1,I:2 


78-84 


8,152 


SP0197 


dihydrofolate 
synthetase, putative 


24-3036^,64-86,93-99,106-130,132-145,148-165,171- 

177,189-220,23&*249,251-263,293^0>3Q2-312323- 

329,338-356;369-379390-412 


L:9 


179-193 


9, 153 


5P0212 


RibosoDial protein L2 


30-39,61-67,74-81,90-120,123-145,154rl67,169-179,182- 
197,200-206,238-244,267-272 


L:10 


230-265 


10,154 


5P0222 


Ribosomal protein S14 


14-20,49-65,77-86 


H:14, L:8, 
M:3 


2-68 


11,155 


5F0239 


Conserved 
hypoOietical ptotein 


^9,26-35^-48,53-61,63-85,90-101,105-111413' 

121,129-137,140-150,179-188,199-226,228-237,248- 
255,259-28539^8314-331337-343353-364410- 
421436-442 




110-144 


12,156 


SPQ251 


formate 

acefyltransferase^ 
putative 


36-47,55-63,94-108,129-134^144-158,173-187,196- 

206,209-238^1-266,270-285,290-295300^306333. 

344346-354366^97,404-410,422-435,439-453,466- 

473315^523329-543354-569^71-585390-596,607- 

618,627-643,690^96,704-714,720-^741-749,752- 

767,780-799 


Gw2,H:7, 


225-247 
480-507 


13,157 


SFQ295 


nbosomal protein S9 
(rpsi) 


16-253^70,80-93,100-106 


1:4 


78-130 


14, 158 


5P0330 


sugar binding 
transcriptional 
regulator KegR 


18-27,41-4630-57,65-71,79-85,93-98,113-128,144r 

155,166-178,181-188,201-207,242-26^,265-273,281- 

295303-309318-327 


G:l, H:l^ 
L:4 


36-64 


15,159 


5F0368 


cell wall surface 
anchor £amily protein 


7-2931-4430^9,91-96,146-153,194-201,207-21^232- 

238;264-278,284-290,296^30?326-35336O6703^ 

384,400-405409-418,420-435,442-460,499-506329- 

534356-562364-576,644-651^77-684,687-698,736- 

743,759-766,778-784308-814352-858374-896,920- 

925,929-935,957-965,1003-1012^1021-1027,1030- 

1044rl081-1087,1101-llll,1116-1124rll48-1159,1188- 


D:1,H:3, 

I:1,L:1, 

M:3 


1-70 

154-189 

922-941 

1445-1462 

1483-1496 


16,160 
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S. pneufTtoniiis 
antigenic 
protein 


Pniative function 
(by homology) 


predicted inmumogenic aa** 


No. of 
selected 
clones per 
ORFand 

screen 


Location of 
identified 
immunogenic 
ieglcni(aa) 


Seq. 

ID 
OONA, 
ProL) 






1196,1235"1251,1288-1303,1313-1319A328-1335^1367- 
1373,1431-1437,1451-1458,1479-1503,1514-1521,1530- 
1540,1545-1552,1561-1568,1598-1605,1617-1647,1658- 
1665,1670-1676,1679-1689,1698-1704,1707-1713,1732- 
1738A744rl764 








5P0369 


Penidllln binding 
protein lA 


6^t81-9U04-113,126-137,150-159,164-174,197- 
209,215-224,229'235;256-269,276-2a2,307-313,317- 
348,351-357,376^97,418-437,454-464,485-490,498- 
109,547-555,574^6,602-619 


B:l, E:l, 
L:13,M:1 


452^530 


17, 161 


5P0374 


tiypothetical protein 


25^1,39^/^56,99-114;,121-127,159-186,228-240,253- 
2e9,271.279,303^15,365-38^395-405>414425,438^ 


B:4,I-1, 
L:3 


289-384 


18, 162 


SP0377 


Choline binding 
protein C 


9-24,41-47,49-54,68-78,108-114>117-122,132-140,164- 

169,179-186493-199^06-213,244-251,267-274,289- 

294,309014^7^ 


G.5,H:4, 
r:l,K:88, 
L:3,M:8 


209-249 
286-336 


19,163 


SP0378 


choline binding 

protein J (cbpj) 


9-28,53-67,69-8237-93,109.117,172.177,201-207,220. 
227,242-247,262-268,305^18,320-325 


EC:47,L:6, 
M:S 


286-306 


20,164 


5a?0390 


choline binding 
protein G (cbpG) 


4-10,26-39,47-58,63-73,86-96,98-108,115-123,137- 

143,148-155460-176,184-189,194-204,235-240,254- 

259,272-278 


G:l, K:69, 
M:6 


199-283 


21,165 


SP0454 


tiypottietical protein 


ir26,33-39,47«,59-65,76-83,9l-97,l04-112,118- 

137,155-160,167-174,198-207,242.268,273-279,292- 

315,320^2,345-354,358-367,377-394403-410,424- 

439,445-451,453-497,511^18,535-570,573^9,592- 

601,604-610 


H:1,I:1, 
L.'6 


202-2'^ 


22,166 


5P0463 


cell wall surface 
anciior family proton 


8^,36^,64-71,76-82,97-103,105-112,134-151,161- 

183,211-234r253-268,270-276,278-284,297-d05,309. 

315,357^2,366-372,375-384,401-407,409-416,441- 

455,463-470,475-480,490-497,501-513,524-537,552- 

559,565-576,581^590,592-600,619-625,636-644,646-656 


A:l,Bt2, 
G4,E:1, 
F:4. 


n6-419 


23,167 


5P0466 


sortase/ putative 


4-17,52-58,8499,102-110,114.120,124-135,143-158,160- 
173,177-196,201-216,223-250,259-267,269-275 


E:1,M:2 


1-67 


24,168 


5P0468 


Sortase;, putative 


6-46,57-67,69-80,82-133,137-143,147-168,182.187,2(^ 
209,214-229,233-242,246-280 


G:24, 
BI:20, L:l 


53-93 


25,169 


5P0498 


endo-beta-N- 
aoetylgltbQOsaminidas 
B/ putative 


7-40,50-56,81^9,117-123,202-209,213-218,223-229,248- 

261,264r276,281-288,303-a08,313-324326^33^34D. 

346,353-d7V34.443,465-474>514623,556-564,605- 

616,620^,631-636,667-683,685.699,710-719,726- 

732,751-756,760-771,779-788,815-828,855-867^- 


B.5,C1, 
E:^F:1, 
G:2 


1226-1309 
1455-1536 
1538-1605 


26, 170 
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Sm pneuiitottUu 
antigenic 
protein 


Putative foncfion 
0>y]unnology) 


predicted immunogenic aa** 


No.of 
selected 
clones per 
ORFand 

screen 


Location of 
identified 
immunogenic 
region (aa) 


Seq. 
ID 
(DNA, 
Prot) 






B79397^02,917-924>926-931,936-94^981-10004006- 

10154017-10284030-10394046-1054,1060-1066,1083- 

10924099-111^1122-11304132-11404148-11584161- 

11714174-1181,1209-1230,1236-1244,1248-1254,1256- 

1267,1269-1276,1294-1299,1316-13284332-1354,1359- 

1372,1374-1380,1384-1390,1395-1408,1419-1425,1434- 

1446,1453-14604465-1471,1474-14934505-1515,1523- 

1537,1547-15554560-15674577-16054633-1651 








SF0509 


type I restxiction- 
modification system, 
Msubunit 


4-10,31^9,81^,106-112,122-135,152-158,177-184,191- 
197,221-227,230-246,249-255,303511,317^26,337- 

344,346-362365-371,430-437,439-446/453-462,474^ 


1:2 


449-467 


27,171 


SP0519 


dnaj protein (dnaj) 


9-15,24^,47-55,122-128,160-177,188-196,202-208,216- 

228;25O-261,272-303,318-324327-339,346-352355- 

361,36M73 


A:l,l>:2, 


108-218 
344576 


28,172 


5F05Z9 


BlpC ABC tzansporter 
(blpB) 


S-1417-48,55-63,71-90,99-109416-124,181-189,212- 

223,232-268,270-294r297-504319-325,340-348,351- 

370,372-578,388^94,406-415,421-434 


A:1,B:3, 
C3,D:1, 
F:4> 


177-277 


29,173 


SP0564 


hypothetical protein 


21^9,42-61,65-75,79-85,108-115 


H:3 


11-38 


30, 174 


5P0609 


amino acid ABC 
ttansporter, amino 
add-binding protein 


4-17,26^9,61-76,103-113,115-122,136-142,158-192,197- 
203,208-214U25-230,237-251 


1:3 


207-225 


31, 175 


SP0613 


metallo-beta- 
lactamase supeifamily 
protein 


5-n,27^,42-53^-70,74-93,95-104,114rll9,127- 

150,153-159473-179,184-193,199-206,222-241,248- 

253,257-280,289-295,313-319,322-342:,349-365,368- 

389,393-406,408-413,426-438,447-461,463-470,476- 

495,532-537,543-550 




225-246 


3^176 


SF0641 


Serine protease 


«9,68^123-130,141-147,149-157,178-191,203- 

215,269-277,300-307,327^,359-370,374580382- 

388393-400,410-417,434-442,483-492,497-503305- 

513333^364-569,601-^,639-647,655^66,693- 

^6,712-718,726-736,752-758,763-771,774-780,786- 

799306-812320^28352-«63384-«92,901-909,925- 

932,943-948,990-996,1030-1036,1(61-1059,1062- 

1068,1079-1086,1105-1113,1152-1162,1168-1179,1183- 

1191,1204.1210,1234-1244>1286-1295,1318-1326,1396- 

1401,1451-1460,1465-1474,1477-1483,1488-1494,1505- 

1510,1514-1521,1552-1565,1593-1614,1664-1672,1677- 

1685,1701-1711,1734-1745,1758-1770,1784-1798,1840- 


A:19,B:72, 

C34,D:5, 
E:21, F:86, 
G:26, 

EI:86,I:17, 

L:130, 

M:29 


1-548 

373-490 

573-767 

903-1043 

1155-1198 

1243-1482 

1550-1595 

1682-1719 

1793-1921 

2008-2110 


33,177 
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S. ptteutnoitiae 
antigenic 
protein 


Putative function 
Off homology) 


predicted inummogenic aa** 


No. of 
selected 
clones per 
ORF and 

screen 


Location of 
identified 
imnxunogenic 
regpion(aa) 


Seq. 
ID 
(DNA, 
Plot) 






fl847,1852-1873,1885-18914906-191U931-1939,1957- 
I970.1977-1992,2014-2020;2026-2032^116-2134 








SP0648 


beta-galactosidase 
(bgaA) 


l(W5^9-52,107-11^181-188,226-236,238-253,25S- 

268,275-284;296^10^26-338«345^^380-389^91- 

m4ia418y420-429,444-456^489^^73nS88^16. 

&23,637-643,726-739,741-767,785-791,793-803,830- 

847^7-881,886-92^949-956,961-980,988-1004>1009- 

10184027-1042,105M069,1076-X0894108-11154123- 

1135,1140-1151,1164-1179,11»-1191,1210-1221,1223- 

1234,1242-1250,1255-12674281-1292^1301-1307,1315- 

134M348-13K,1366-13734381-1413,1417-1428,1437- 

1444.1453-1463,1478-1484,1490-14964498-1503,1520- 

1536,1538-1546,1548-1570,1593-1603,1612-1625,1635- 

1649,1654-1660,1670-1687,1693-1700,1705-1711,1718- 

1726,1729-1763,1790-1813,1871-1881,1893-1900,1907- 

1935,1962-1970,1992-2000,2006-2013,2033-2039,2045- 

2051,2055-2067,2070-2095,2097-2110,2115-2121,2150- 

2171,2174-2180,2197-220^,2206-2228 


C:l, E:l, 
F:1,G:1, 
H:4,l:l, 
Nl:2 


1526-1560 


34,178 


SP0664 


Zinc metallopiotease 
ZmpB^ putative 


4-17,35-48,54-76,78-107,109-115,118-127,134-140,145- 

156,169'174;^17-226,232-240,256-262,267-273^16- 

328,340-346,353-360>402-409,416-439,448-456,506- 

531,540^546,570-578,586-593,595-600,623-632,662- 

667,674-681,689-705,713-724,730-740,757-763,773- 

778,783-796,829-835,861-«71,88d-899,907-939,941- 

955,957-969,986-1000,1022-1028,1036-1044,1068- 

1084,1095-11024118-1124,1140-1146,1148-1154,1168- 

1181,1185-11904197-1207,1218-1226,1250-1270,1272- 

1281,1284-1296,1312-1319,1351-1358,1383-1409,1422- 

1428,1438-1447,1449-1461,1482-1489,1504-1510,1518- 

15274529-1537,1544rlS51,1569-1575,1622-1628,1631- 

1637,1682-1689,1711-1718,1733-1740,1772-1783,1818- 

1834,1859-1872 


A:9,Ba5, 
C:13,D:7, 
6:14, P:77, 
G:12, 
H:10, 

K:67,L:13, 
M:6 


1-64 
128-^ 


35,179 


SF0567 


pneumococcal surface 
proteizir putative 


8-28,32-37,62-69,119-125,137-149,159-164,173-189,200- 

2(B,221-229,240-245,258-265,268-276,287-293;296- 

302,323-329 


A;72,B;80, 
C-90,X>^, 
E:1^F:53 


1-95 


36,180 


5P0688 


UDP-N- 

acetylmuramoylalanin 
e-D-glutamate Ugase 


9-18,25^,49-63,65-72,74-81,94-117,131-137,139- 

146,149-158,162-188,191-207,217-225,237.252,255- 

269,281-293,301^26,332-342347-354363-370,373. 


1:3 


75-107 


37, 181 



wo 2004/092209 PCT/EP2004/003984 

-67- 



S, pneumoniae 
antigenic 
pxotein 


Patative functioxi 
(by homology) 


predicted immunogenic aa** 


No. of 
selected 
clones per 
ORFand 

screen 


Location of 
identified 

region (aa) 


Seq. 
ID 
(DNA, 
Prot.) 






^;391-400,415-424>441-447 








SP0749 


branched-chain amino 
add ABC transporter 


4-24,64-71,81^7,96-116,121-128,130-139,148-155,166- 

173,176-184,203-215,231-238,243-248;256-261,280- 

286,288^314^9 


E:2,L8, 
L:8 


67-148 


38,182 


5P0770 


ABC transporter, 
ATP-binding protein 


^10,19-37/16-52,62-81,83-89,115-120434-139,141- 
151,168-186,197-205,209-234^241-252,322-335,339- 
345363^79,385^393,403-431,434-442:447-454>459- 
465479-484^-496 


U2 


404-420 


39,183 


SP0785 


conserved 

Kypothetical protein 


10-35,46-66,71-77,84-93,96-122,138-148,154-172,182- 

213,221-233,245-263,269-275,295-301303^311- 

}20324^340^351-d59375^1 


C:1,E:2, 
1:1 


111-198 


40,184 


5P0914 


nodulin-related 
protein^ truncation 


14-2530-42,47-61,67-7531-91,98-106,114-122,124- 

135,148-193,209-227 


L2 


198-213 


41,185 


SP0930 


dioline binding 
protein E<dypE) 


5-18,45^5032-90,97-114,116-136,153-161,163-171,212- 

Z19321-227,240-249,267-281311-317328-337375- 

381390-395,430-436y449-455,484-49533&543348- 

554356-564380-586396-602 


E:4, G:2, 
E1:1,I:2, 


493-606 


42,186 


5P0943 


Gid protein (gid) 


?-25,28-3437-44>61-68,75-8l38-96,98-lll,119-133,138- 

150,152-163,168-182,186-194,200-205,216-223;236- 

245357-264^279-28733-304311-318325-330340- 

M6353-358365-379399-409,444-453 




303-391 


43,187 


5P0952 


alanine 

dehydrogenase, 
authentic fraxneatdft 
(aid) 


16-3635-6136-76,78-102,121-130,134-146,150-21:^221- 
239355-276389-322329357 


Gi3,H:4 


29^9 


44,188 


5P10Q3 


QonsoTved 
h3rpothetical protein 
(PAT) 


8-2738-74,77-99,110-116,124-141,171-177302-217321- 

228359-265375-290,293-303309-325335-343345- 
351365-379384^94,406-414,423-437/452-465,478- 
507325-534354^0,611-624,628-651,669-682,742- 
747,767-778,782-792304-812320-836 


A:2,B5, 

C:8, D5, 
B:13,F:3, 


79-231 
359-451 


45,189 


5P1004 


Conserved 
hypothetical protein 


5-2839-4536-6237-74,77-99,110-117,124-141,168- 

176300-230337-244368-279,287-299304-326329- 

535348-362370-376379-384390^06,420-429,466- 

471,479-489^5^504329-541345-553361-577398- 

S04322-630337-658372^382-688390^6398- 

709,712-719,724-736,738-746,759-769380-786396- 

804313-818360-877,895-904,981-997,1000-1014,1021- 

1029 


A:5,B:4, 
D4,D:9, 
E:12,F:4, 

L:l 


1-162 

206-224 

254-350 

414r514 

864-938 


46,190 
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5. pnsuMottUtB 
antigenic 
protein 


Pntafive function 
Qby banology) 


predicted immunogenic aa** 


No.of 

selected 
clones per 
ORFand 
screen 


Location of 
identified 
iTTi nTnn ogeni c 
region (aa) 


Seq. 

ID 
OONA^ 
ProL) 


5P1124 


(glgA) 


1-1149^.56-66^101409-116,123-145456-165,177- 
185;204-221,226-234^42-248,251-256,259-265^- 
302^7-330^a349,355-374;377-383,392-400,422- 
428,434-442,462-474 


M:l 


266-^ 


47,191 


SP1154 


rgAl protease 


14-43,45-57,64-74^0-87406-127,131-142,145-161,173- 

180,182-1883»-210,213-219,221-243;Z45-254304r 

311^14^,342-^,354365,372-^78,394-399,407- 

431,436-448,459-465,470-477,484490,504-509,531- 

537,590-596,611-617,642-647,723-734,740-751,754- 

76?,764-774,782-797,807-«1^824-831,838-845377- 

885,892-898,900-906,924-935,940-946,982-996,1006- 

1016,1033-1043,1051-1056,1058-1066,1094-1108,1119- 

1126,1129-1140,1150-11574167-1174,1176-1185,1188- 

1201,1209-1216,1220-1228,1231-1237,1243-1248,1253- 

1285,1288-1297,1299-1307,1316-1334,1336-1343,1350- 

1359,1365-1381^1390-1396,1412-1420,1427-14394452- 

1459,1477-1484,1493-15124554-1559,1570-1578,1603- 

1608,1623-1630,1654-16594672-1680,1689-1696,1705- 

1711,1721-1738,1752-1757,1773-1780,1817-1829,1844- 

1851,1856-1863,1883-18^4950-1958,1974-1990 


A:6,Bi2, 

C^,Eh3, 

E:4,F:2, 

G:6,H:4, 

fcl3,L:12 


172-354 
384-448 
464-644 
648-728 
1357-1370 


48,192 


SP1174 


conserved domain 
protein (PAT) 


8-27,68-74,77-99,110-116,124rl41,169-176,201-216,220- 

227,258-264,274-289,292-302,308^24334-34234- 

350,364^72377-387,399-407,416-429,445-458,471- 

481,483^318-527347-553,604-617,621-644,662. 

675,767-778309-816 


B:14, 017, 
D:6,E:18, 
F:16,L1, 
ECt5, Lil, 
M:8 


15-307 

350-448 

496-620 


49,193 


5PU75 


cxinserved domain 
protein 


4-17,24-2933^i62-84,109-126459'164,189-204>208- 

219,244r249,274-290,292-302308.324334-342344- 

350378-389391^97,401-409,424-432,447-460/470- 

479,490-504321n529338-544349-555370-577383- 

592,602^,615-630,635^364-677,692-698,722- 

731,733-751,782-790,793-799 


A:1,B:4, 
C'3, 

E:9,F:2, 
H:2,M:4 


56-267 

337-426 

495-601 


50, 194 


SP1221 


type n restriction 
endonudease 


12-2249h59,77«,111-121,136-148,177.186,207- 

213,217.225,227-253,259-274>296-302328-333343- 

354374^,424-446,448-457,468-480,488-502307- 

522344^353-560364-572387-596,604^14319- 

S25329H635338-656362.676380-692397-713720- 

738,779-786333-847361-869380-895397-902,911- 

917,946-951,959-967,984.990,992-1004,1021-1040,1057- 


HtXf 
K:1,L:4 


381-403 


51, 195 
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5. jnteuntoniae. 
antigenic 
protein 


Putative function 
0>y homology) 


predicted immunogenic aa** 


No. of 
selected 
clones per 
ORFand 

screen 


Location of 
identified 
immtmogenic 
region (aa) 


Seq. 
ID 
(DNA, 
Prot) 






1067,1073-1080 








SP1227 


^NA-binding 
response regulator 


4-10^31,46^6,60-66,70-79,86-94>96-lQ2,109-118,132- 
152,164-187,193-206;ai7-224 


E:1,L:3 


Bl-149 


5^196 


5P1241 


amino acid ABC 
transporter^ amino 
add-binding pro 


4-2i;26-37,48^,71-82409-117,120-128430-136,142- 
i47,l81-187^03-211^16-223^47-255;257-284316- 
325^73-379^95-400,423-435,448-456,479-489^12- 
576,596-625,641-678,680-688.692-715 


3i2^ Ol, 
^Ll 


346^ 


53,197 


SP1287 


signal recognition 
partide protein (£&) 


l(>-16;25^1,34-56,58-69,71-89,94r.ll0,133-176,186- 
193,208-225,240-250,259-266,302^,335-341^76- 
383,410416 


1B:8,G:8, 
H:3,M:1 


316-407 


54,198 


5P1330 


aoetybnannosamine' 
S-P epimerase^ 
putative (nanE) 


ll-29/£2-^,60-7532-88,95-n0,116-126,132-143,145- 
160A66-172^184r216 


L:45 


L23-164 


55, 199 


5P1374 


Chodsmate sythetase 
(aroQ 


^l-29,54^,110-117,139-15^158-l66,172-180A86- 
193^15-236;^40-25132-d23,330-335,340-347,350- 
366,374^1 


G:1,L:29, 
M-14 


252-299 


56,200 


5P1378 


conserved 

hypothetical protein 


18-2735^,50-56,67-74412-136,141-153,163-171,176- 

189,205-213,225-234>241-247,253-258,269-281,288- 

298,306-324326-334355^69,380-387 


H:2 


289-320 


57, 201 




peptidase^ U32 family 


7-15,19-4136-72,91-112,114-122,139-147,163-183,196- 

209,258-280326^8357-363391^,406-416 


BA 


360-378 


58,202 


SP1478 


oxidoreductase, 
aldoyketo reductase 
fiamily 


ll-18,29-41,43-49,95-108,142-194,204-212,216-242^,247- 
256,264-273 


H:ll 


136-149 


59,203 


SP1518 


conserved 

hypodieticai protein 


18-243340,65-7939-102,113-119,130-137,155-161,173- 

179,183-203,^219r223-231,245-26X267-274;»6- 

306311-321330^1344.363369-381/«)1-408,415- 

427,437444,453-464,472-478,484-508317-524326- 

532343-548 


A:10,B:4, 
G:5,H:1 


59-180 


60,204 


SaP1522 


conserved d<nnain 
protein 


5-1332-65,67-73,97-110,112-119,134-155 


B:4C:6, 
E:1,H:7, 


45-177 


61,205 


SP1527 


oligopeptide ABC 
transporter 


6-28344337-67,75^1,111-128,132-147,155-163,165- 
176,184-194,208-216,218-229,239-252,271-278328- 
334363^76381-388,426-473,481-488,492-498307- 
513336-546364-582390^1.607-623 


A:1,B:1, 
C:4,F:1, 
G:26, 
H:18,I:10, 


148-269 
420-450 
610^ 


6^206 
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S« pneusiwniae 
aniigexiic 
pzofein 


Putative fanrtion 
Q)y homology) 


predicted immunogenic aa** 


No.o£ 
selected 
clones per 
ORFand 

screen 


Location of 
identified 
immunogenic 
region (aa) 


Seq. 
ID 
ff>NA, 
Prot) 


3P1573 


tysozyzne (lytQ 


4-12,20-38^9-75,83^,123-128445-15^154rl61483> 

188;200-213,245-250,266-27^306-312332^9^57- 

369^3-389^9&402,437-453,455-470,497-503 


A:40,B:27, 
C:24,Eh2, 
E:6, G:ll, 
K:l 


1-112 


63,207 


5P1604 


hypottietical protein 


3&59,74-«6411-117,122-137 


All/ 03, 
E:1,G:1, 
tl 


70-154 


64,208 


SP1661 


crell division protein 
DivIVA 


26-42,54-61,65-75,101-107,123-130,137-144,148. 
156,164-172,177-19^213-221^1-258 


Ea 


157-249 


65,209 


5P1664 


ylmE protein (ylmP) 


29-38,61-67,77-87,94-100,105-111,118-158 


B:1,C42, 
1:3 


1-97 


66,210 


SP1676 


N-acetylxieuiaminate 
lyase/ putative 


7-21,30^,51-58,60-85,94'123,134-156460-167,169- 
183,186-191,216-229,237-251,257-267,272-282,287-298 


H:2 


220-243 


67,211 


SP1687 


neuraminidase B 
(nanB) 


6-29,3447,56-65,69-76,83-90,123-134,143-151,158- 
178,197-203,217-235,243-263,303^309,320-333,338- 
348,367-^,387-^93,407-414>416-427^-457,473- 
482/i87-499,501^,514-520,530^,577-583,590- 
602,605-612,622-629,641-670,678-690 


B3,E:2, 
L:1,M:2 


37-71 
238^7 


68,212 


SP1693 


neuraminidase A 
(nanA) 


7-40,121-132,148-161,196-202,209-215,221-235,248- 
255,271-280,288-295,330-339,395-409,414-420,446- 
^l,^&487356-563,568-575,580-586,588-595,633- 
638,643^,652-659,672.685,695-700,710-716,737- 
742,749-754,761-767,775-781,796-806,823-835,850- 
863,884-890,892-900,902-915,934-941 


C:3,D:5, 
E:3,F:1, 
G:7,H:1, 

U4 


406-521 


69,213 


5P1732 


serine/threonine 
protein kinase 


9-18,24-46,51-58,67-77,85-108,114-126,129-137,139- 

146,152-165,173-18?,188-195,197-204,217-250,26D- 

274;£96-313,343-366,368-d84427-434r437.446,449- 

i55,478-484,492n506,522-527,562n591,599-606,609^ 

618,625-631,645-652 


E:^H:1 


577-654 


70,214 


SP1735 


mefhionyl-tRNA 
fonnyltransfierase 

(£mt) 


13-20,26^7,41-53,56-65,81-100,102-114,118-127,163- 
188,196-2Q^1.238,245-252,266-285,293-298;301-^06 


K:13,M:13 


19-78 


71,215 


5P1759 


preprotein 
translocase^ SecA 
subunit (secA-2) 


10-23,32-4?,54^,73-91,106-113,118-127,139-15?,164- 

173,198-207,210-245,284^00,313-318330-337,339- 

346,354-361,387^93,404-426,429-439441-453,467- 

473,479-485,496-509,536-544,551-558,560.566,569- 

574,578-588,610.615,627-^^-675,679.690,698- 

716,722-734,743-75^769-780,782-787 


M'2 




72,216 
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S. pttBUfttOftiuB 

antigenic 
ptotein 


Putative function 
(by homology) 


predicted immunogenic aa*** 


No.of 

selected 
clones per 
ORF and 
screen 


Location of 
identified 
intmonogenic 
region (aa) 


Seq. 
ID 
(DNA, 
Prot) 


SP1772 


cell wall surface 
anchor feunfly protein 


6-39,42-50,6M8,76-83,114rl29,147-16247M89497- 

205;jl7-23i;»9-248,299<305,338-344352-357371- 

377,380-451,459-483,491-499^-523337-559^7- 

613,625^1,689-729,737-781,785^,817-865373- 

881,889-939,951-975,983-1027,1031-1055,1063- 

1071,1079-1099,1103-1127,1151-1185,1197-1261,1269- 

13094317-1333,1341-1349,1357-1465,1469-1513,1517- 

1553,1557-1629,1637-1669,1677-1701,1709-1725,1733- 

1795,1823-1849,1861-1925,1933-1973,1981-2025,2029- 

2053,2061-2109,2117-2125^133-2183,2195-2219,2227- 

2271,2275-2299,2307-2315,2323-2343,23^-2371,2395- 

2429,2441-2529,2537-2569,2577-260i;2609-2625,2633- 

2695,2699-2737,2765-2791,2803-2867,2889-2913,2921- 

2937,2945-2969,2977-2985,2993-3009,3023-304S,3073- 

3099,3111-3167,3175-3215/3223-3267,3271-32953303- 

33513359-33673375-34253437-34613469-35133517- 

35413549-35573565-35853589^6133637-36713683- 

37473755^7953803-38193827-38353843-39513955- 

3999,4003-4039,4043-4115,4123-4143^147-4171,4195- 

4229^4241-4305,431343534361-4377,4385-43934401- 

45094513-4557,4561-45974601-47184749-^68 


8:9, Gl, 
D:1,F:13, 
G:1,H:3, 
1:1, L:l, 
M:2 


74-171 

452-^ 

2951^61 


73,217 


SP1804 


general stress protein 
14, putative 


16-2230-51,70-111,117-130,137-150,171-178,180- 
188,191-196 


b4 


148-181 


74,218 


SP1888 


oligopeptide ABC 
transporter^ ATP- 

binding protein AmiE 


6-1931-4630^,80-86,118-126,167-186,189-205,211- 
242344-267373-286390-297307-316320-341 


a-1 


34^ 


75,219 


5P1891 


oligopeptide ABC 
tran^K>rter/ 


5-2633-43,48^5438-63,78-83,113-120,122-128,143- 
15^157-175,185-192311-225,227-234344-256370- 
281384-290304^10330-337348^362-379384- 
394^9-445450-474483-490311-520337-546348- 

554361-586390-604,613-629 


A:2,B:3, 
E:1,F:1, 
G:13,H:8 


149-186 
285-431 
573-659 


76,220 


SP1937 


Autolysin Gy^) 


5-26,49-59,61-6733-91,102-111,145-157,185-192,267- 
2^379-286392-298306-312 


D:3, F:l, 
G:1,H:2, 
K:11,M:1 


134-220 
235-251 
254-280 


77,221 


SP1954 


serine protease/ 
subtilase fiEunUy, 
authentic frame 


5-19,72-7933-92,119-124,140-145,160-165,167-182,224- 

232340-252359-270301-310313-322332-343347- 

367384^398416-^,431-446,454-461 


0:43, E:6, 
L4>K:21, 
L:50 


1-169 


78,222 


SP1980 


cn^binding-factor 1 


8-1736-3136^75-83,93-103,125-131,135-141,150- 


Ei:9 


127-168 


79,223 
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S. pnetanomae 
antigeoic 
proteiii 


Putative functioii 
(by homology) 


predicted inummogemc aa** 


No. of 
selected 
clones per 
Om?and 

screen 


Location of 
iderttified 
immtucLogenic 
r^fm(aa) 


Seq. 

ID 
(DNA, 
Fiot) 




(cbfl) 


194>205-2l7,233-258^-26a31-286 








SP1992 


cell wall surface 
anchor family protein 


6-lW9-75408-115439-lS9476-18il94-214 


B:5,G1, 
F;4,I:1 


46-161 


80,224 


SP1999 


cataboUte control 
pxotein A (ccpA) 


6^13,18-27^9-48^1-59^73,79-85,95-101409-116,118- 
124,144-164,166-177483-193,197-204.215-223,227. 

236^42-249,252-259,261-270,289-301^18-325 


D2 


12-58 


81,225 


5P2Q21 


glycosyl hydrolase 


4-10;26-3248-60,97-105,117-132,138-163,169-185,192- 

214^19-231,249-261,264-270,292-308,343-356,385- 

392^98-404^8-417,435-441 


L:3 


24-50 


8^226 


5P2027 


Conserved 
hypothetical protein 


1040,^-48,51-61,119-126 


A:1,E:1, 
G:19, 
H:1^L16, 
L:5 


1-118 


83,227 


5P2039 


conserved 
hypothetical protein 


5-17>4O58,71-83,103-lll,123-140,167-177,188-204 


G:1,L:3 


116-128 


84,228 


5P2048 


Conserved 
hypothetical protein 


4-9,11-50,57-70,112-123,127-138 


L1,L:4 


64-107 


85,229 


5P2051 


Conpetenoe protein 
CglC 


9-393M7 


0.1,0:3, 
I:8,L:26 


1-101 


86,230 


5P2092 


UTP-gJ.ucose-1- 
phosphate 
uridylyitransferase 
(galU) 


5-14,17-25,28-46,52-59,85-93,99-104,111-120,122- 
131,140-148,158-179,187-197,204-225,271-283,285-293 


H:2 


139-155 


87,231 


SP2099 


Penidllin binding 
protein IB 


42-70,73-90,92-108,112-127,152-164,166-172,181- 

199,201.210,219-228,247-274,295^Q?322-334336- 

346/353^,396-414,419-425/432-438,462-^,518- 

523,531^,561^7,576-589,594-612,6»)-631,665- 

671,697-710,718-731,736-756,765-771,784-801 


A:l,B.-9, 
C:ll, D:l, 
B:6,F:1, 
Ei:4,K:l 


626-653 


88,232 


3F2108 


Maltose ABC 
transporter 


8-28/41-51,53-6^68-^74,79-85,94-100,102-108,114- 

120,130-154,156-16^175-180,198-204>206-213,281- 

Z94>308^18,321-339,362-368,381-^,393^99,407-415 


G:10,H;1, 
L:10,M:1 


2-13 


89,233 


5P2120 


hypothetical protein 


4-39,48-65,93-98,106-112,116-129 


1:2 


10-36 


90,234 


SP2128 


transketolase« N- 
terminal subunit 


25-3235-50,66-71,75-86,90-96,123-136,141-151,160- 
179,190-196,209-215,222-228,235-24^,257-263,270-280 


H:2 


209^247 


91,235 


SP2136 


choline binding 
pxotein PcpA 


5-29,31-38,50-57,62-75,83-110,115-132,168-195,197- 
206,216-242;,249-258,262-269,333^,342n350,363- 
368,376^92,400-406/410-421/423-430^-442^ 
454,460-466,471-476/^-496,511-516,531-536,551- 


C3,R1, 
G:24, 
Eia2,I:13, 
K:177, 


27-70 
219-293 
441-504 
512-584 


92,236 
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S. piteutnoitiae 
antigenic 
protein 


Paiative function 
(by homology) 


pxedicfed immimogenic aa** 


No. of 
selected 
clones p» 
ORPand 

screen 


Location of 
identified 
inuminogenic 
region (aa> 


Seq. 
ID 
(DNA, 
Plot) 






556^71-576,585«t599-605 


L:34rM:l8 






SP2141 


glyoosyl hydrolase- 
celated protein 


4rlZ14n3447-75^104.107-115,133-140,148-185487- 

196^-212;224-256,258-265^1-287^9-29638- 

308^25^,345^^371^-395>424-435Ml- 

457,465-472,483-491,493-505,528-534,536-546^- 

558,575'584^9^,616-623 


L:3 


576-591 


93,237 


5P2180 


conserved 

hypothetical protein 


4-76, 78-89. 91-126, 142-148. 161-191. 195-208. 
211-223. 226-240. 256-277, 279-285. 290-314. 317- 
323, 35M77, 381-387, 391-396. 398-411. 415-434, 
436-446, 454-484, 494-512. 516-523. 53&«2, 559- 
566, 571-577, 579-596. 599-615, 620-627. 636-644, 
694-707, 720-734, 737-759, 761-771 


t3 


313-329 


94,238 


5P2190 


choline binding 
protein A (cbpA) 


5^-38,44-49,79-89,99-108,117-123,125-132,137-146,178- 

187^-237;245-255,322-337,365-387398^08,445- 

462,603^.623^28>44-650,657-671,673-679 


A:6, B:12, 

C:9,D;6, 

E;30,F:8, 

a*65, 

a72,I:76, 

EC:222, 

L.-99,M:37 


111-566 


95,239 


5P2194 


ATP-dependent Op 
protease^ ATP- 
binding subunit 


5-20,22^,39-45,58-64,77-117,137-1441,158-163,205- 

210,218-224,229-236.239-25X263-277^99^,323- 

334353-384388-396^9-438^443-448458-463,467- 

478,481-495,503-509,511^26359-576,595-600,612- 

645,711-721,723-738,744-758,778-807 


Htl 


686-720 


96,240 


5P2201 


choline binding 
protein D (cbpD) 


10^35-41/5^-84,129-138,158-163^03-226,243- 
252,258-264;279-302322-329381-d86,401-4D6,414435 


B:4.C:3, 

D:1,E:7, 

P:1,G:1, 

H:2,K:26, 

Mil 


184-385 


97,241 


SF2204 


ribosomal protein L9 


4-9,19-24,41-47,75-85,105-110,1X3-146 


H:3,U4 


45-62 


98,242 


5P2216 


secreted 45 kd protein 
- homology to ghican 
binding protein 
(GbpB) S jnutant 


4-25,52-67,117.124,131-146,173-180,1^-191^195- 

206,215-22i;229-236,245-25^258-279,286-291,293- 

302314-320327-336341-353355-361383^9 


A'130. 

8:414, 

C:450, 

D:162, 

E:166, 

F284, 

G.-90, 

fl:16,l:4, 

EClO, L:29, 


1-285 


99,243 
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S. pneumotUte 
antigenic 
protein 


Fntative function 
(by lioinology) 


predicted immunogenic aa** 


No. of 
selected 
Jones per 
ORP and 

screen 


Location of 
identified 
immtmogenic 
region (aa) 


Seq. 
ID 
(DNA, 
Prot) 








^•11 






SP-NRFl 


C^ioline binding 
protein 


l«^38-50,73^93-105409-114 


Fll 


10-70 


100,244 














AKFO408 


HLypothetical protein 


5-26 


L:3 


22-34 


101, 245 


ASF0441 


Hypothetical protein 


23-28 


H:3 


13-39 


102,246 


ARF0690 


riypothetical protein 


5-14 


L'JZ 


21-34 


103,2^ 


ARP0878 


Hjrpottietical protein 


^13^29^44^^-74 


H:3 


41-69 


104,248 


ARF0921 


Hypothetical protein 


4-9, 19-4?, 48^9, 71-83 


M<4 


57-91 


1(»,249 


ARF1153 


Hypothetical protein 


4-14 


M:7 


10-28 


106,250 


ARF1515 


Hypothetical protein 


22-28^2-42,63-71^1-111,149-156,158-167,172-180,182- 
203,219-229 


G:4,H:5 


27-49 


107,251 


AKF1519 


Hypodietical protein 


17-27 


^3 


23-32 


108,252 


ARF1905 


Hypothetical protein 


18-24 


H:2 


28-38 


109,253 


AIUP2044 


Hypothetical protein 


9-15 


G:2,H:5 


13-27 


110,254 


ARF2155 


Hypothetical protein 


13-22 




18-29 


111,255 


ARB2199 


Hypodietical protein 


17-26 




2-11 


112,256 


CKF0129 


Hypothetical protein 


4r33 


L:4 


16^2 


113,257 


CRF0200 


Hypothetical protein 


Srl0;37-43,54^92.127 


H,5, L:l 


15-62 


114,258 


CRF0236 


Hypothetical protein 


4..14,20-3235-60,69-75,79-99,101-109,116-140 


L:3 


124-136 


115, 259 


CRF0394 


Hypothetical protein 


aone 


H:7 


2-13 


116,260 


CRF0408 


Hypothetical protein 


4-13,28-42 


L:ll 


42-57 


117, 261 


CRKM30 


Hypothetical protein 


4-14,27-44 




14^ 


118, 262 


CKF0498 


Hypothetical protein 


4-12 


H:4 


1-27 


119,263 


CRF0519 


Hypodnetical pzot^ 


4-18,39-45,47-74 


G:5,It3 


35^ 


120,264 


CRF0573 


Hypothetical protein 


8-20,43-77 


t3,L:9 


17-36 


121,265 


CRF0713 


Hypothetical protein 


4^,35-45,51-57 




35-49 


122,266 


CRF0722 


Hypothetical protein 


4-24>49-67 


G:18 


15^34 


123,267 


CRF0764 


Hypothetical protein 


4-22 


L:4 


B-27 


124,268 


CRF1079 


Hypothetical protein 


13-25,32-59,66^ 




21^ 


125,269 


CRF1248 


Hypothetical protein 


4-10/24-33,3&42,54^,72-«2,98-108 


EX.4 

n:l 


U>-3U 




CBF1398 


Hypothetical protein 


8-19 


H:1,L^3 


17-47 


127,271 


CRF1412 


H3^aietical protdn 


12-18,40-46 


US 


31-52 


128,272 


CRF1467 


Hjrpothetical protein 


4-20,35-78,83-102,109.122 


1:4 


74-86 


129, 273 


CRF1484 


Hypothetical protein 


7-17,21-41,46-63 


ts 


2-20 


130,274 


CRF1587 


Hypothetical protein 


30-37 


G3,H:3, 


2-33 


131,275 
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S. ptteutnoftUut 
antigenic 
protein 


Fatative fnnction 
(by homology) 


predicted imsnmogenic aa** 


No. of 
selected 
clones per 
ORP and 

screen 


Location of 
identified 
immunogenic 
region (aa) 


Seq. 
ID 
(DNA, 
Prot.) 








L:4 






CRF1606 


Hypothetical protein 


4-1347-25 


U3 


1-15 


132,276 


CRF1623 


HCypothetical protein 






20-51 


133, 277 


CRF1625 


Hypothetical protein 


20.30 


UIO 


5-23 


134,278 


CRF1640 


Hypothetical protein 


13-3348-71 


[5 


92-110 


135, 279 


CRF1702 


Hypothetical protein 


4-9,50-69,76-88,96-106413-118 


L:6 


12-34 


136, 2oU 


CKF1825 


Hyj>othetic£il protein 


4-24 


L:ll 


6-26 


137,281 


CRF1883 


Hypothetical protein 


7-26 


H:61, U77 


14^0 


138,282 


CRF1991 


Hypothetical protein 


9-39,46-68,75-8^84-103 


a-6,L:2 


26-44 


139,283 


C3y?1992 


Hypothetical protein 


4-30;33-107 


M:7 


58-84 


140,284 


CRF2004 


Hypothetical protein 


4-12 


L:3 


9-51 


141,285 


C3RF2030 


Hypothetical protein 


12-18,2937 


HS,L:1, 

Mil 


6-37 


142,286 


CSF2065 


Hypothetical protein 




LI, M;6 


16-37 


143, 287 


CRF2232 


Hypothetical protein 


9-19 


L:3 


2-30 


144,288 
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Table 2. Immunogenidty of epitopes in peptide ELISA 



Peptides 


D4 

Pi 


P2 


P3 


P4 


DC 
PS 


Da 

Po 


P7 


DO 

Po 


DO 


D4n 

riu 


D44 


PI 9 


no 


N1 


N2 


N3 


N4 


N5 


N5 


N7 


N8 


N9 


N10 


s 


from 


to 


Seq 
ID 


ARF0408.1 


nd 










nd 




nd 
































4^ 
10 




Or 


94S 


ARF0441.1 


nd 










nd 




nd 
























'!• ■ 








9ft 


Q 

o 


£.( 






nd 










nd 




nd 
































90 


lU 


07 
c.t 




ARF0878.1 












nd 












nd 


nd 


MB 


















AO 






ARF0878.2 












nd 












nd 


nd 






















44 

n 




D9 


248 


ARF0921.1 












nd 












nd 


nd 






















41% 


OO 


An 


9AQ 


ARF0921^ 












nd 












nd 


nd 






















4A 
1** 


7A 


Q1 




ARF1 153.1 


- : 
nd 










nd 




nd 
































40 


41 


Oft 


o'ln 


ARF1515.1 


nd 










nd 




nd 
































4R 


CO 




9R1 
^O 1 


ARF1519.1 


nd 










nd 




nd 
































4 A 
10 


4 f% 






ARFigOS.I 


nd 










nd 




nd 






■i 










1 '-'iV 










- ; 






94 
a1 


A 

** 




^oo 


ARF2044.1 


nd 










nd 




nd 
































4fi 


lU 


97 




ARF2155.1 


nd 










nd 




nd 






I * \ ■ 












K '\ 














4K 

10 


4T 

1 f 






ARF2199.1 


nd 






: 




nd 




nd 




-^^^^ 
























•>:!^ i 




4D 


4 


4ft 




CRF0129.1 


nd 






111! 




nd 




nd 


















:r 






.' --^ 








4e 

10 


4 A 
ID 






CRF0200.1 


nd 










nd 




nd 
































12 


ACS. 
ID 


OD 


ORQ 
ZOO 


CRF0200.2 


nd 










nd 




nd 




















-. 












9 


OA 


4y 


ORO 


CRF0200.3 


nd 










nd 




nd 






5- 












x:. 












■ - 


10 


43 


62 


ORQ 


CRF0236.1 












nd 




r. _ • 








nd 


nd 










■ '^]' 




_ 








19 


122 


139 


OCA 

2o9 


CRF0394.1 


nd 










nd 




nd 






• S • ' 


























20 


1 


18 


260 


CRF0408.1 


nd 










nd 




nd 




. ' . 
























" 




19 


41 


58 


261 


CRF0430.1 


nd 










nd 




nd 
































15 


15 


36 


262 


CRF0498.1 


nd 










nd 




nd 


























• ■ 






21 


2 


27 


263 


CRF0573.1 


■ 








nd 












nd 


nd 










it/-;-;. 










22 


18 


36 


265 


CRF0713.1 


nd 










nd 




nd 






















m 






20 


34 


51 


266 


CRF0764.1 


nd 






nd 




nd 
































16 


9 


27 


268 


CRF1 079.1 


nd 








i 


nd 




nd 






















I.- J 






27 


22 


47 


2oiJ 


CRF1 398.1 


nd 










nd 




nd 
































24 


4 Q 

lO 


oo 


071 

2/1 


CRF1398.2 


nd 






nd 




nd 




UlWt 

m 




























21 




/IT 
HI 


071 
Z/1 


CRF1412.1 












nd 












nd 


nd 






















9 


32 


52 


272 


CRFI 467.1 


nd 










nd 




nd 






















m 














O0 


07Q 


CRF1484.1 


nd 










nd 




nd 






























3 


OA 


274 


CRrlooT.I 


nd 










nd 




nd 




ilk 




























Z3 


3 


Q4 


07K 

2f 0 


CRF1587uc 


nd 










nd 




nd 
























II 








21 


15 


OO 


OTC 

2r0 


CRF 1606.1 


nd 










nd 




nd 
































22 


1 


18 


276 


CRF1625.1 


nd 










nd 




nd 




„ 

ml 




























23 


6 


23 


278 


v#Krlo4U.l 


nd 










no 




nd 




























m 




4ft 
lO 




1 in 


07Q 
Zr «7 


CRF1702.1 


■ 








nd 












nd 


nd 






















18 


13 


34 


280 


CRF1825.1 


nd 










nd 




nd 






























24 


7 


26 


281 


CRF1825.2 


nd 










nd 




nd 








•;,fl 






















24 


9 


26 


281 


CRF1883.1 


nd 










nd 




nd 
















■ 














20 


16 


33 


282 


CRF1991.1 


nd 










nd 




nd 
































24 


27 


44 


283 


CRF1992.1 


nd 














nd 






nd 




m 






















19 


67 


84 


284 


CRF2004.1 


nd 










nd 




nd 
































20 


10 


33 


285 


CRF2004.2 


nd 










nd 




nd 
































22 


26 


50 


285 


CRF2030.1 


nd 








m 


nd 




nd 
































21 


7 


25 


286 



wo 2004/092209 



PCT/EP2004/003984 



■77- 



CRF2030^ 


nd 










nd 




nd 










■". • 


















■ "> • 




20 


19 


37 


286 


CRF2065.1 


nd 










nd 




nd 


- 




























21 


17 


37 


287 


CRF2232.1 


nd 




■ 




nd 




nd 






























23 


3 


20 


288 


CRF2232^ 


nd 










nd 


'm 


nd 












:•; 




n 










m 




19 


13 


30 


288 


SP0008.1 






















nd 


nd 


nd 




















28 


62 


80 


145 


SP0008.2 






















nd 


nd 


nd 




















21 


76 


93 


145 


SP0069.1 












nd 












nd 


nd 






















17 


92 


108 


147 


SP0071.1 


nd 










nd 




nd 
































15 


332 


349 


148 


SP0071.2 












nd 












nd 


nd 






















13 


177 


200 


148 


SP0071.3 












nd 












nd 


nd 






















4 


1755 


1777 


148 


SP0082.1 












nd 












nd 


nd 






















4 


109 


133 


149 


SP0082.2 


nd 










nd 




nd 
















III 
















24 


149 


174 


149 


SP0082.3 


nd 






H 




nd 




nd 
































26 


260 


285 


149 


SP0082.4 










nd 












nd 


nd 






















20 


460 


485 


149 


SP0107,1 


nd 














nd 






nd 


























10 


26 


47 


150 


SP0107.2 


nd 














nd 






nd 








.r ^' 


















11 


42 


64 


150 


SP0117.1 






















nd 


nd 


nd 












it 






m 




18 


22 


41 


151 








mm 


mm 








57 


35 


54 


151 










III! 




64 


115 


130 


151 


SP0117.4 








■I 














nd 


nd 


nd 


W 


d 


















17 


306 


325 


151 


SP0117.5 






















nd 


nd 


nd 






K 
















IS 


401 


420 


151 


SP0117.6 ■III^^HIII 




III^^Hnd 


nd 


nd 








68 


454 


478 


151 


SP0222.1 












nd 












nd 


nd 






















15 


22 


45 


155 


SP0368.1 












nd 












nd 


nd 






















22 


156 


174 


160 


SP0368.2 


il 










m 










nd 


nd 


nd 


Hi;!' 


m 


fi 




m: 








32 


924 


940 


160 


SP0368.2b 












nd 












nd 


nd 






















12 


1485 


1496 


160 


SP0368.3 












nd 












nd 


nd 






















14 


1447 


1462 


160 


SP0368.4 












nd 












nd 


nd 






















11 


1483 


1498 


160 


SP0369.1 


III 










nd 












nd 


nd 






















16 


457 


475 


161 


SP0377.1 












nd 












nd 


nd 






















11 


302 


325 


163 


SP0378.1 












nd 




m 








nd 


nd 




















m 


17 


288 


305 


164 


SP0390.1 












nd 












nd 


nd 






















11 


244 


266 


165 


SP0390^ 












nd 




nd 
































24 


260 


282 


165 




SP0466.1 
SP0468.1 
SP0468.2 
SP0498.1 
SP0498,2 
SP0498.3 
SP049M 
SP0498.5 
SP0498.6 
SP0498.8 



■ 



IB 



— 
nd 

nd^ 

nd 

nd" 

nd 

nd 



"iSS 



7^ 



m 



39 
"54 

70 
1227 
1539 
1554 
1569 
"1584 
1242 
1272 



64 
"76 

92 
1247 
1559 
1574 
1589 
1604 
1262 
1292 



168 
169 
169 
170 
170 
170 
170 
170 
170 

Tto 
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SPOBdl 6 



miEIIEIIZII 
I IEIIEIB3i 

inisiisiisi 





nH 

IIU 










nd 




nd 
































18 


2092 


2111 


177 


SP0648.1 


nd 










nd 




nd 
































11 


1528 


1547 


178 


SP0648.2 


nd 










nd 




nd 
























II 








24 


1541 


1560 


178 


SP0664.1 


nd 




■ 




nd 




nd 








■ 
























28 


184 


200 


179 


SP0664.2 


nd 










nd 




nd 
































18 


367 


388 


179 


SP0664.3 


nd 










nd 




nd 


■ 












- 
















15 


382 


403 


179 


SP0e64.4 


nd 










nd 




nd 
































14 


409 


429 


179 


SP0684.5 


nd 










nd 




nd 




















Ii 












20 


425 


444 


179 


SP0664.6 


nd 










nd 


i 


nd 








■ 






















17 


438 


457 


179 


SP0667.1 


nd 














nd 






nd 




m 






















22 


27 


50 


180 


SP0667.2 


nd 














nd 






nd 


























16 


45 


67 


180 


SP0749^ 


nd 










nd 


■ 


nd 












■ 














■ 


24 


114 


131 


182 


SP0770.1 


nd 






m 




nd 




nd 




















m 












17 


405 


419 


183 


SP0785.1 


nd 














nd 






nd 


























11 


113 


134 


184 


SP0785.2 


nd 














nd 






nd 


























10 


129 


150 


184 


SP0785.3 


nd 














nd 






nd 


























8 


145 


166 


184 


SP0785.4 


nd 














nd 






nd 


























11 


161 


182 


184 


SP0785.5 


nd 














nd 






nd 


























8 


177 


198 


184 


SP0930.1 


nd 








m 


nd 


m 


nd 


m 






m 














m 


m 






m 


30 


495 


515 


186 


SP0943.1 


nd 










nd 




nd 
































19 


346 


358 


187 


SP1 004.1 


nd 






n 


m 


nd 




nd 


























■ 


21 


208 


224 


180 


SP1 154,1 


nd 






nd 


111 


nd 










i 






















33 


178 


194 


192 


SPi154.2 


nd 










nd 




nd 




m 


























28 


202 


223 


192 


SP1154.3 


nd 










nd 




nd 
































21 


217 


238 


192 


SP1154.4 


nd 










nd 




nd 
































19 


288 


308 


192 


SP1 154.5 


nd 










nd 




nd 
































16 


1355 


1372 


192 


SP1175.1 


nd 


■ 




nd 




nd 


1 












1 




57 


78 


194 


SP1241.1 


nd 














nd 




nd 




















m 


26 


347 


369 


197 


SP1241.2 


nd 










IM 




nd 






nd 
























18 


364 


386 


197 
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Table 3: Gene distzibiiiion in 5. pneumoniae strains. 



ORF 




Gene distribution 
(present of 50) 


Amino add 
substitations (in 
serotype 14 strain)'^ 


Homology 0 


Seq. 
[D(DNA^ 
Frot.) 


5P0008 


hypothetical protein 


n.d. 


rud. 




1,145 


5P0032 


DNA polyxnecase I (polA) 


n.d. 


n.d. 




2, 146 


5P0069 


Choline binding protein I 


7 


0/166# 




3, 147 


5P0071 


inununoglobuBn Al protease (iga-l) 


7 


0/477# 




4, 148 


SP0082 


Cell wall surface anchor 


50 


5/385 




5, 149 


SP0107 


LysM domain protein 


50 


1/173 




6, 150 


SP0117 


pneumococcal surface protein A 
(pspA) 


zud. 


n.d. 




7, 151 


SP0191 


hypothetical protein 


lud. 


n.d. 




8,152 


SP0197 


dihydrofolate synthetase, putative 


n.d. 


n.d. 




% 153 


5F0212 


Ribosomal protein L2 


50 


0/232 




iai54 


SP0222 


Ribosomal protein S14 


n.d. 


n.d. 




11,155 


SP0239 


Conserved hypothetical protein 


n.d. 


n.d. 




1^,156 


SP0251 


formate acetyl transferase, putative 


rud- 


n.d. 




13,157 


SP0295 


ribosomal protein S9 (rpsi) 


50 


1/121 




14,158 


SP0330 


sugar binding transcripkional 
regulator RegR 


xud. 


n.d. 




15, 159 


5F0368 


□ell wall 8urfisu3e anchor family 
protein 


46 


4/422# 




16,160 


5P0369 


Penicillin binding protein lA 


50 


1/346 




17, 161 


SP0374 


h3^pothetical protein 


n.d. 


n,d. 




18, 162 


SP0377 


Choline binding protein C 


29 


0A14 




19,163 


5P0378 


choline binding protein J (cbpj) 


50 


2/104 




20,164 


5F0390 


choline binding protein G (cbpG) 


50 


3/171# 




21^ 165 


SP0454 


hypothetical protein 


48 


l/303» 




22,166 


5F0463 


cell wall surface andior family 
protein 


10 


0/298# 




23,167 


5P0466 


sortase^ putative 


44 


4/243# 




24,168 


5P0468 


Sortase/ putative 


18 


0/254# 




25, 169 


SF0498 


endo-beta-N-aoety^fluoosaminidase, 
putative 


50 


4/334 




26,170 


5F0509 


type I restriction-modificatiQn 
system, M subunit 


n.d. 


n.d. 




27,171 


SP0519 


dnaj protein (dnaj) 


50 


2/312 




28,172 


5P0529 


BlpC ABC transporter (blpB) 


50 


6/306 




29,173 


5F0564 


hypothetical protein 


50 


1/127 




30, 174 
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Common name 


Gene distribution 
(present of 50) 


Amino add 
substitutions (in 
seiolype 14 straixO* 


Homology 0 


Seq. 
ID (DNAr 
rTOt.j 


SP0609 


amino acid ABC txansporter, amino 
idd-binding pro 


50 


0/232 




ol, I/O 


SF0613 


metaUo-beta-lactamase superfamily 
protein 


n.d. 


n.d. 




32^,176 


5P0641 


Serine protease 


n.d. 


n.d. 




33,177 


5P064S 


Teta-galactosidase (bgaA) 


50 


0/304 




34> 178 


5F0664 


Zinc metalloprotease ZmpB, putative 


n.d. 


ltd. 




35, 179 


5P0667 


pneumococcal surface proteiiv 

putative 


45 


18/297 




36,180 


5F0688 


[JDP-N-acetylmviramoylalanine--D- 
glutamate ligase 


nxl. 


n.d. 




37, 181 


SP0749 


brandied-chain eunino add ABC 
transporter 


50 


4/303 




38,182 


SP0770 


ABC transporter, ATP-fainding 
protein 


50 


0/307 




39,183 


SP0785 


conserved h3^thetical protein 


50 


0/304 




40, 184 


SP0914 


nodulin-related protein, truncation 


n.d. 


n.d. 




Al, 185 


5P0930 


choline binding protein E (cbpE) 


47 


17/294 




42, 186 


5P0943 


Gid protein (gid) 


n.d. 


n.d. 




43,187 


5P0952 


alanine dehydrogenase/ authentic 
£rameshift<ald) 


iLd. 


n.d. 




44/ 188 


SP1003 


c!onsenred hypothetical protein (PAT) 


n.d. 


n.d. 




45, 189 


SP1004 


Conserved hypothetical protein 


lud. 


nd. 




46, 190 


SP1124 


glycogen synthase (glgA) 


n.d. 


n.d. 




47, 191 


SP1154 


[gAl protease 


28 


13/470; 80missing 




48,192 


5P1174 


conserved domain protein (PAT) 


n.d. 


n.d. 




49, 193 


SP1175 


conserved domain protein 


n.d. 


lud. 




50, 194 


5P1221 


type n restriction endonudease 


n.d. 


n.d. 




51, 195 


SP1227 


DN A*binding response regulator 


n.d. 


n.d. 




52, 196 


5P1241 


amino add ABC transporter^ amino 
add-binding protein 


50 


0/285 




53,197 


SP1287 


signal recognition particle protein 
(ffli) 


49 


0/300 




54,198 


5P1330 


N-acetylmannosanune-6-P 
epimerase^ putative (nanE) 


14 


U/Z11» 




1.77 


SP1374 


Chorismate sythetase (aroC) 


50 


0/289 




56,200 


SP1378 


conserved hypo^tical protein 


n.d. 


n.d. 




57,201 


5P1429 


peptidaser 1132 family 


50 


S/305 




58,202 


5P1478 


oxidoreductase, aldo/keto reductase 


n.d. 


n.d. 




59,203 
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ORF 


jominon name 


Gene distribution 
(present of 50) 


Amino acid 

SUDSumAUuxiS liu 
SCrvCyp** Ana PMMMi, 




[D(DNA, 
Plot) 




amily 










SP1518 


□onservea hypotneacal pzotem 


en 


AM^- ^ additional 




60,204 


5P1522 


□onserved domain protein 


n.Q. 


n.d. 




61,205 


5P1527 


oligopeptide ABC tianspoztei 


DU 


0/463 




6^206 


SP1573 


lysozyme (lylQ 


TX.CL 


n.d. 




63,207 


SP1604 


hypothetical protein 


cn 
DD 






64^208 


SP1661 


cell division protein DivIVA 


50 








5P1664 


ylmF protein (jdmF) 


SO 


0/164 




66,210 


5P1676 


M-aoetyineuzaininate lyase, putative 


ILd. 


n.d. 




67, 211 


5P1687 


neuraminidase B (nanB) 


n.d. 


n.d. 




68,212 


5P1693 


neuraminidase A (naitA) 


n.d. 


xud. 






SP1732 


serine/threonine protein kincise 


49 


2/293 




70, 214 


SP1735 


methionyl-'tRNA £ormyltrans£erase 
(ftnt) 


n.d. 


xud. 




71^ 215 


SP1759 


pieprotein translocase, SecA subunit 
(secA-2) 


rud. 


n.d. 




7!?, 216 


SP1772 


cell wall sui&ce andior family 
protein 


23 


12/253# 




73^ 217 


5P1804 


general stress protein 24, putative 


n.d. 


n.d. 




74,218 


5P1888 


oligopeptide ABC transporter, ATP- 

binding protein AmiE 


lud. 


n.d. 




75, 219 


5F1891 


oligopeptide ABC transporter. 


n.d. 


n.d. 




76,220 


5P1937 


Autolysin QytA) 


50 


0/275 




77, 221 


5P1954 


serine protease, subtilase family, 
authentic frame 


12 


0/305# 




78, 222 


5F1980 


canp4>inding-£actor 1 (d>fl) 


n^. 


n.d. 






SF1992 


cell wall surface anchor family 
protein 


50 


4/197 








catabolite control protein A (ocpA) 


■n ri 
TLCL, 






81,225 


SP2021 


glycosyl hydrolase 


TLd. 


n.d. 




82,226 


5F2Q27 


Conserved hypotlieticai protein 


n.d. 


ZuGU 




83,227 


SF2039 


conserved hypothetical protein 


n.d. 


rucL 




fid 


5P2048 


Conserved hypothetical protein 


50 


8/134 




OR OOQ 




i^onpecenoe protion v^giL. 


OU 


ft/Q9 

0/7A 




86,230 


5P2092 


UTP-glucose-l-phosphate 
uridylyltransferase (galU) 


n.d. 


rud. 




87,231 


5P2099 


Penedllin binding protein IB 


n.d. 


rud. 




88,232 


5P2108 


^falt06e ABC transporter 


50 


V279 




89,233 


5P2120 


hypothetical protein 


n.d. 


n.d. 




90,234 
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ORF 


^osocimofl 1UUU6 


Gene distribution 
(present of 50) 


Amino add 
snbstitations Qn 
seiuiype m^k p«i<iiti/ 


Homology u 


Cam 

aeq. 
ID fDNA 
Frot.) 


5F2128 


transketolase, N-tenninal subunit 


lud. 


ILd. 




91,235 


5P2136 


choline binding prot^ PcpA 


45 






92,236 


5P2141 


g;lya3syl hydxolase-ielated protein 




ILG. 




93,237 


SP2180 


conserved hs^dietical protein 


n.d. 


M A 
TLCL. 




94, 238 


5P2190 


choline binding protein A (cbp A) 


47 


foe 48^%; rev: 2 A7# 




95,239 


5P2194 


ATP-dependent Qp protease;, ATP- 
binding subunit 


50 


1/262 




96,240 


5P2201 


choline binding protein D (cbpD) 


50 


7/384 




97,241 


SP2204 


ribosomal protein L9 


XLd. 


n.d. 




98,242 


5P2216 


secreted ^ kd pxotdn - homology to 
glucan binding protein (GbpB) 
Sanutant 


50 


0/347 




99,243 


5P-NRF1 


Choline binding protein 


n.d. 


ILd. 




XMUf ^Irx 














ARF0408 


Hypothetical protein 


n.d. 


_ A 

n.a. 




101, 245 


AKF0441 


Hypottietical protein 


n.d. 


ILd. 






AKF0690 


Hypothetical protein 


n.d. 


^ A 

ILO. 






AKF0878 


Hypothetical protein 


n.d. 


A 

ILCL 




104, 248 


ARF0921 


Hypothetical protein 


iLd. 


«« A 
ILCI. 




105, 249 


ASF1153 


Hypothetical protein 


n.d. 


•« A 
ILd. 




106 250 


AKF1515 


Hypothetical protein 


n.d. 


n.d. 




107, 251 


ARF1519 


HjrpotiKetical protein 


n.d. 


_ A 

ILO. 




ins 9S9 


ARF1905 


Hypothetical protein 


n.d. 


ILd. 




iriQ *>^v\ 


ARF2044 


Hypothetical protein 


ILd. 


ILd. 




IIU, 


ARF2155 


Hypothetical protein 


n.d. 


ILd. 




111, ZDO 


AICF2199 


Hypothetical protein 


xud. 


ILd. 




110 otu^ 


CRF0129 


Hypottietical protein 


rud. 


ILd. 






CRF0200 


Hypothetical protein 


n.d. 


ILd. 




■\-\A oeo 


CRF0236 


Hypothetical protein 


n.d. 


ILd. 




1-tt; ocQ 
±19, ZJOry 


CRF0394 


Hj^pothetical protein 


n.d. 


A 

ILOL 




116, 260 


CKF0408 


Hypothetical protein 


ILd. 


ILU. 




117, 261 


CRr0430 


Hypothetical protem 


ILd. 


A 

ILO. 




11ft 

XXO, ^flOfc 


CBF0498 


Hypodietical protein 


XL.Ua 


ILd. 




119,263 


CRF0519 


Hypothetical protein 




n.d. 




120,264 


CRF0573 


H3fpothetical protein 


ILd. 


n.d. 




121,265 


CRF0713 


Hypothetical protein 


ILd. 


ILd. 




122,266 


CRF0722 


Hypothetical protein 


* ILd. 


n.d. 




123,267 


CRF0764 jHypothetical protein 


ILd. 


ILd. 




124,268 
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ORF 


Common name 


Gene di8tiilnifx0n 
(present of 50) 


Amino acid 
snbstitntions dn 
serotype 14 strain)* 


Homology 0 


Seq. 
roCDNA, 
Prot.) 


CRF1079 


Hypothetical protein 


n.d. 


n.d. 




125,269 


CRF1248 


Hypothetical protein 


rud. 


iLd. 




126,270 


CRF1398 


Hypothetical piDt^ 


n.d. 


n.d. 




127,271 


CRF1412 


Hypottietical protein 


xud. 


n.d. 




128,272 


CRF1467,1 


Hypothetical protein 


ad. 


n.d. 




129,273 


CRF1484 


Hypothetical protein 


lud. 


n.d. 




130,274 


CRF1587 


Hypothetical piotein 


n.d. 


n.d. 




131,275 


CKF1606 


Hypothetical piotein 


n.d. 


n.d. 




132,276 


CRF1623 


Hypothetical protein 


n.d. 


n.d. 




133,277 


CRF1625 


Hypothetical protein 


n.d. 


n-d. 




134,278 


CRF1640 


Hypothetical proton 


n.d. 


ILd. 




135,279 


CRF1702 


H}^thetical protein 


n.d. 


n.d. 




136,280 


CRF1825 


Hypothetical protein 


lud. 


n.d. 




137, 281 


CRF1883 


Hypothetical piotein 


n.d. 


n.d. 




138,282 


CRF1991 


Hypothetical piotein 


n.d. 


lud. 




139,283 


CRF1992 


Hypothetical piotein 


lud. 


ILd. 




140,284 


CRF2004 


Hypothetical protein 


n.d. 


ILd. 




141,285 


CRF2030 


Hypothetical protein 


xud. 


ILd. 




142,286 


CKF2065 


Hypothetical piotein 


lud. 


n.d. 




143,287 


CR3P2232 


Hypothetical protein 


xud. 


ILd. 




144,288 
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Table 4. 



ORF 


Common Nsme 


FAGS 


PK 


ARF0878 


hypothetical protein 


+ 


nd 




[lypothetical protein 


+ 


nd 


CRF0236 


hypothetical protein 


++ 




CRF0573 


hypothetical protein 


+ 


nd 


CRF1412 


hypothetical protein 




nd 


CRF1702 


hypotfieticai protein 


+ 


nd 


CRF1992 


hypothetical protein 


++ 


++ 


SP0008 


hypothetical protein 


+ 


« 


SP0089 


Choline binding protein 1 


++ 


++ 


SP0082 


Celt waii surface anchor 


+ 


« 


SP0117 


pneumococcal surface protein A (pspA) 


+++ 


+++ 


SP0212 


Ribosomal protein L2 


+ 


++ 


SP0295 


rfbosomal protein S9 (rpsi) 


++ 


+++ 


SP0368 


cell wall surface anchor family protein 


++ 


+++ 


SP0369 


Penedllin binding protein 1A 


++ 


++ 


SP0377 


Choline binding protein C 


++ 




SP037e 


choline binding protein J (cbpJ) 


++ 


nd 


SP0390 


choline tnnding protean G (cbpQ) 


++ 


+ 


SP0454 


hvDotheficai Dfoteln 


++ 


+++ 


SP0463 


cell wall surface anchor family protein 




++ 


SP0466 


sortase Dutativa 




++ 


SP0468 


Sortasfi nutative 


•M* 


++ 


SP0519 

%Jl WW 1 w 




++ 


+ 


SP0609 


amino add ABC transporter, amino 3^d~bind 


++ 


+ 


SP0641 


Serine protease 






SP0664 


Zinc metalioprotease ZmpB 


* 


4-1- 


SP0749 


branched-ciiain amino ac^d ABC transporter 


+ 


+ 


SP0770 


ABC transporter, ATP-binding protein 


++ 


++ 


SP1154 


lgA1 protease 


++ 


++ 


SP1287 


signal recognition particle proton (flh) 


•!• 


■M- 


SP1330 


N-acetytmanrK>S6amlne-6-P 


++ 




SP1429 


peptidase, U32 family 


+ 


++ 


SP1527 


oligopeptide ABC transporter 


+ 




SP1759 


preprotein translocase. SecA subunit (N/vrong donelll] 


+ 




SP1772 


cell wall surface anchor family protein 


+ 


+ 


SP1891 


oligopeptide ABC transporter 


+ 


++ 


SP1937 


Autolysin (lytA) 


+ 




SP1954 


serine protease, subtilase family, auth frame 


+ 


++ 


SP1980 


cmp-binding-factor 1 (cbfl) 


+ 




SP2108 


Maltose ABC transporter 


+ 


++ 


SP2136 


choline binding protein PcpA 


+ 


++ 


SP2190 


choline binding protein A (cbpA) 


+ 


++ 


SP2194 


ATP-dependent Clp protease, ATP-bind subu 


++ 


++ 


SP2201 


choline binding protein D (cbpD) 


+ 


++ 


SP2216 


secreted 45 kd protein 


+ 


++ 
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Claims: 

1. An isolated nucleic add molecule encoding a hyperimmune senim reactive antigen or a fragment 
thereof comprising a nucleic acid sequence, which is selected from the group consisting of: 

a) a nudeic add molecule having at least 70% sequence identity to a nudeic add molecule selected 
from Seq ID No 1, 101-144. 

b) a nudeic add molecule which is complementary to the nudeic add molecule of a), 

c) a nudeic add molecule comprising at least 15 sequential bases of the nudeic add molecule of a) 
orb) 

d) a nudeic add molecule which anneals under sbringent hybridisation conditions to tiie nudeic 
add molecule of a), b), or c) 

e) a nudeic add molecule which, but for tiie degeneracy of ihe genetic code, would hybridise to tiie 
nudeic add molecule defined in a), b), c) or d). 

2. The isolated nudeic add molecule according to daim 1, wherein the sequence identity is at least 
80%, preferably at least 95%, espedally 100%. 

3. An isolated nudeic add molecule encoding a hj^erimmune serum reactive antigen or a fragment 
thereof comprising a nudeic add sequence selected from the group consisting of 

a) a nudeic add molecule having at least 96% sequence identic to a nudeic add molecule selected 
from Seq ID No 2-6, 8, 10-16, 18-23, 25-31, 34. 36, 38-42, 44, 47-48, 51, 53, 55-62, 64, 67, 71-76, 78- 
79, 81-94, 96-100. 

b) a nudeic add molecule vAddx is complementary to the nudeic add molecule of a), 

c) a nudeic add molecule comprising at least 15 sequential bases of the nudeic add molecule of a) 
orb) 

d) a nudeic add molecule which anneals under stringent hybridisation conditions to the nudeic 
add molecule of a), b) or c), 

e) a nudeic add molecule which, but for the degeneracy of the genetic code, would hybridise to the 
nudeic add defined in a), b), c) or d). 

4. An isolated nudeic add molecule comprising a nudeic add sequence selected from the group 
consisting of 

a) a nudeic add molecule selected from Seq ID No 9, 17, 24, 32, 37, 43, 52, 54, 65-66, 70, 80, 

b) a nudeic add molecule which is complementary to the nudeic add of a), 

c) a nudeic add molecule which, but for the degeneracy of the genetic code, would hybridise to the 
nudeic add defined in a), b), c) or d). 

5. The nudeic add molecule according to any one of tiie daims 1, 2, 3 or 4, wherein the nudeic add is 
DMA, 

6. The nudeic add molecule according to any one of the claims 1,2, 3, 4, or 5 wherein the nudeic add 
isRNA. 

7. An isolated nudeic add molecule according to any one of claims 1 to 5, wherein the nudeic add 
molecule is isolated from a genomic DNA, espedally from a S. pneumoniae genomic DNA. 

8. A vector comprising a nudeic add molecule according to any one of daims 1 to 7. 

9. A vector according to claim 8, wherein the vector is adapted for recombinant expression of the 
hyperimmune serum reactive antigens or fragment thereof encoded by the nudeic add molecule 
according to any one of daims 1 to 7. 
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10. A host cell comprising the vector according to claim 8 or 9. 

11. A hyperimmune serum-reactive antigen comprising an amino add sequence being encoded by a 
nucleic add molecule according to any one of the daims 1, 2, 5, 6 or 7 and fragments thereof, 
wherein the amino add sequence is selected from the group consisting of Seq ID No 145, 245-288. 

12. A hyperimmune serum-reactive antigen comprising an amino add sequence being encoded by a 
nudeic add molecule according to any one of the daims 3, 5, 6, or 7 and fragments thereof, 
wherein the amino add sueqnece is seleded from the group consisting of Seq ID No 146-150, 152, 
154-160, 162-167, 169-175, 178, 180, 182-186, 188, 191-192, 195, 197, 199-206, 208, 211, 215-220, 222- 
223, 225-238, 240-244. 

13. A hyperimmxme serum-reactive antigen comprising an amino add sequence being encoded by a 
nudeic add molecule according to any one of the claims 4, 5, 6, or 7 and fragments thereof, 
wherein the amino add sequence is seleded from the group consisting of Seq ID No 153, 161, 168, 
176, 181, 187, 196, 198, 209-210, 214, 224. 



14. Fragments of hyperimmune serum-reactive antigens selected from the group consisting of peptides 
comprising amino add sequences of coliunn "predicted immunogenic aa" and "location of 
identified immunogenic region" of Table 2; the serum reactive epitopes of Table 2, espedally 
peptides comprising amino add 4-11, 35-64, 66-76, 101-108, 111-119 and 57-114 of Seq ID No 145; 
5-27, 32-64, 92-102, 107-113, 119-125, 133-139, 148-162, 177-187, 195-201, 207-214, 241-251, 254-269, 
285-300, 302-309, 317-324, 332-357, 365-404, 411-425, 443-463, 470-477, 479-487, 506-512, 515-520, 
532-547, 556-596, 603-610, 616-622, 624-629, 636-642, 646-665, 667-674, 687-692, 708-720, 734-739, 
752-757, 798-820, 824-851, 856-865 and 732-763 of Seq ID No 146; 14-21, 36-44, 49-66, 102-127, 162- 
167, 177-196, 45-109 and 145-172 of Seq ID No 147; 17-35, 64-75, 81-92, 100-119, 125-172, 174-183, 
214-222, 230-236, 273-282, 287-303, 310-315, 331-340, 392-398, 412-420, 480-505, 515-523, 525-546, 
553-575, 592-598, 603-609, 617-625, 631-639, 644-651, 658-670, 681-687, 691-704, 709-716, 731-736, 
739-744, 750-763, 774-780, 784-791, 799-605, 809-822, 859-870, 880-885, 907-916, 924-941, 943-949, 
973-986, 1010-1016, 1026-1036, 1045-1054, 1057-1062, 1082-1088, 1095-1102, 1109-1120, 1127-1134, 
1140-1146, 1152-1159, 1169-1179, 1187-1196, 1243-1251, 1262-1273, 1279-1292, 1306-1312, 1332-1343, 
1348-1364, 1379-1390, 1412-1420, 1427-1436, 1458-1468, 1483-1503, 1524-1549, 1574-1588, 1614-1619, 
1672-1685, 1697-1707, 1711-1720, 1738-1753, 1781-1787, 1796-1801, 1826-1843, 132-478, 508-592 and 
1753-1810 of Seq ID No 148; 1543, 49-55, 71-77, 104-110, 123-130, 162-171, 180-192, 199-205, 219- 
227, 246-254> 264-270, 279-287, 293-308, 312-322, 330-342, 349-356, 369-377, 384-394, 401-406, 416-422, 
432-439, 450-460, 464-474, 482-494, 501-508, 521-529, 536-546, 553-558, 568-574, 584-591, 602-612, 
616-626, 634-646, 653-660, 673-681, 688-698, 705-710, 720-726, 736-749, 833-848, 1-199, 200-337, 418- 
494 and 549-647 of Seq ID No 149; 9-30, 65-96, 99-123, 170-178 and 1-128 of Seq ID No 150; 7-32, 
34-41, 96-106, 127-136, 154-163, 18&-199, 207-238, 272-279, 306-312, 318^25, 341-347, 353-360, 387- 
393, 399-406, 434-440, 452-503, 575-580, 589-601, 615-620, 635^, 654-660, 674-680, 696-701, 710-731, 
1-548 and 660-691 of Seq ID No 151; 4-19, 35-44, 48-59, 77-87, 93-99, 106-111, 130-138, 146-161 and 
78-84 of Seq ID No 152; 24-30, 36-43, 64-86, 93-99, 106-130, 132-145, 148-165, 171-177, 189-220, 230- 
249, 251-263, 293-300, 302-312, 323-329, 338-356, 369-379, 390412 and 179-193 of Seq ID No 153; 30- 
39, 61-67, 74-81, 90-120, 123-145, 154-167, 169-179, 182-197, 200-206, 238-244, 267-272 and 230-265 of 
Seq ID No 154; 14-20, 49-65, 77-86 and 2-68 of Seq ID No 155; 4-9, 26-35, 42-48, 53-61, 63-85, 90-101, 
105-111, 113-121, 129-137, 140-150, 179-188, 199-226, 228-237, 248-255, 259-285, 299-308, 314-331, 
337-343, 353-364, 410421, 436-442 and 110-144 of Seq ID No 156; 36-47, 55-63, 94-108, 129-134, 144- 
158, 173-187, 196-206, 209-238, 251-266, 270-285, 290-295, 300-306, 333-344, 346-354, 366-397, 404410, 
422435, 439-453, 466473, 515-523, 529-543, 554r569, 571-585, 590-596, 607-618, 627-643, 690-696, 
704-714, 720-728, 741-749, 752-767, 780-799, 225-247 and 480^507 of Seq ID No 157; 16-25, 36-70, 80- 
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93, 100-106 and 78-130 of Seq ID No 158; 18-27, 41-46, 50-57, 65-71, 79-85, 93-98, 113-128, 144-155, 
166-178, 181-188, 201-207, 242-262, 265-273, 281-295, 303-309, 318-327 and 36-64 of Seq ID No 159; 
7-29, 31-44. 50n59, 91-96, 146-153, 194-201, 207-212, 232-238, 264-278, 284-290, 296-302, 326-353, 360- 
370, 378-384, 400-405, 409-418, 420-435, 442-460, 499-506, 529-534, 556-562, 564^76, 644-651, 677-684, 
687-698, 736-743, 759-766, 778-784, 808-814, 852-858, 874-896, 920-925, 929-935, 957-965, 1003-1012, 
1021-1027, 1030-1044, 1081-1087, 1101-1111, 1116-1124, 1148-1159, 1188-1196, 1235-1251, 1288-1303, 
1313-1319, 1328-1335, 1367-1373, 1431-1437, 1451-1458, 1479-1503, 1514-1521, 1530-1540, 1545-1552, 
1561-1568, 1598-1605, 1617-1647, 1658-1665, 1670-1676, 1679-1689, 1698-1704, 1707-1713, 1732-1738, 
1744-1764, 1-70, 154-189, 922-941, 1445-1462 and 1483-1496 of Seq ID No 160; 6-51, 81-91, 104-113, 
126-137, 150-159, 164-174, 197-209, 215-224, 229-235, 256-269, 276-282, 307-313, 317-348, 351-357, 
376-397, 418-437, 454-464, 485-490, 498-509, 547-555, 574-586, 602-619 and 452-530 of Seq ID No 161; 
25-31, 39-47, 49-56, 99-114, 121-127, 159-186, 228-240, 253-269, 271-279, 303-315, 365-382, 395-405, 
414-425, 438-453 and 289-384 of Seq ID No 162; 9-24, 41-47, 49-54, 68-78, 108-114, 117-122, 132-140, 
164-169, 179-186, 193-199, 206-213, 244-251, 267-274, 289-294, 309-314, 327-333, 209-249 and 286-336 
of Seq ID No 163; 9-28, 53-67, 69-82, 87-93, 109-117, 172-177, 201-207, 220-227, 242-247, 262-268, 305- 
318, 320-325 and 286-306 of Seq ID No 164; 4-10, 26-39, 47-58, 63-73, 86-96, 98-108, 115-123, 137-143, 
148-155, 160-176, 184-189, 194-204, 235-240, 254-259, 272-278 and 199-283 of Seq ID No 165; 4-26, 
33-39, 47-53, 59-65, 76-83, 91-97, 104-112, 118-137, 155-160, 167-174, 198-207, 242-268, 273-279, 292- 
315, 320-332, 345-354, 358-367, 377-394, 403-410, 424-439, 445-451, 453-497, 511-518, 535-570, 573-589, 
592-601, 604-610 and 202-242 of Seq ID No 166; 8-30, 36-45, 64-71, 76-82, 97-103, 105-112, 134-151, 
161-183, 211-234, 253-268, 270-276, 278-284, 297-305, 309-315, 357-362, 366-372, 375-384, 401-407, 
409-416, 441-455, 463-470, 475-480, 490-497, 501-513, 524-537, 552-559, 565-576, 581-590, 592-600, 
619-625, 636-644, 646-656 and 316-419 of Seq ID No 167; 4-17, 52-58, 84-99, 102-110, 114-120, 124- 
135, 143-158, 160-173, 177-196, 201-216, 223-250, 259-267, 269-275 and 1-67 of Seq ID No 168; 6-46, 
57-67, 69-80, 82-133, 137-143, 147-168, 182-187, 203-209, 214-229, 233-242, 246-280 and 53-93 of Seq 
ID No 169; 7-40, 50-56, 81-89, 117-123, 202-209, 213-218, 223-229, 248-261, 264-276, 281-288, 303-308, 
313-324, 326-332, 340-346, 353-372, 434-443, 465-474, 514-523, 556-564, 605-616, 620-626, 631-636, 
667-683, 685-699, 710-719, 726-732, 751-756, 760-771, 779-788, 815-828, 855-867, 869-879, 897-902, 
917-924, 926-931, 936-942, 981-1000, 1006-1015, 1017-1028, 1030-1039, 1046-1054, 1060-1066, 1083- 
1092, 1099-1112, 1122-1130, 1132-1140, 1148-1158, 1161-1171, 1174-1181, 1209-1230, 1236-1244, 1248- 
1254, 1256-1267, 1269-1276, 1294-1299, 1316-1328, 1332-1354, 1359-1372, 1374-1380, 1384-1390, 1395- 
1408, 1419-1425, 1434-1446, 1453-1460, 1465-1471, 1474-1493, 1505-1515, 1523-1537, 1547-1555, 1560- 
1567, 1577-1605, 1633-1651, 1226-1309, 1455-1536 and 1538-1605 of Seq ID No 170; 4-10, 31-39, 81- 
88, 106-112, 122-135, 152-158, 177-184, 191-197, 221-227, 230-246, 249-255, 303-311, 317-326, 337-344, 
346-362, 365-371, 430-437, 439-446, 453-462, 474-484 and 449-467 of Seq ID No 171; 9-15, 24-35, 47- 
55, 122-128, 160-177, 188-196, 202-208, 216-228, 250-261, 272-303, 318-324, 327-339, 346-352, 355-361, 
368-373, 108-218 and 344-376 of Seq ID No 172; 6-14, 17-48, 55-63, 71-90, 99-109, 116-124, 181-189, 
212-223, 232-268, 270-294, 297-304, 319-325, 340-348, 351-370, 372-378, 388-394, 406-415, 421-434 and 
177-277 of Seq ID No 173; 21-39, 42-61, 65-75, 79-85, 108-115 and 11^8 of Seq ID No 174; 4-17, 26- 
39, 61-76, 103-113, 115-122, 136-142, 158-192, 197-203, 208-214, 225-230, 237-251 and 207-225 of Seq 
ID No 175; 5-11, 27-36, 42-53, 62-70, 74-93, 95-104, 114-119, 127-150, 153-159, 173-179, 184-193, 199- 
206, 222-241, 248-253, 257-280, 289-295, 313-319, 322-342, 349-365, 368-389, 393-406, 408-413, 426-438, 
447-461, 463-470, 476-495, 532-537, 543-550 and 225-246 of Seq ID No 176; 4-29, 68-82, 123-130, 141- 
147, 149-157, 178-191, 203-215, 269-277, 300-307, 327-335, 359-370, 374-380, 382-388, 393-400, 410-417, 
434-442, 483-492, 497-503, 505-513, 533-540, 564-569, 601-607, 639-647,. 655-666, 693-706, 712-718, 
726-736, 752-758, 763-771, 774-780, 786-799, 806-812, 820-828, 852-863, 884-892, 901-909, 925-932, 
943-948, 990-996, 1030-1036, 1051-1059, 1062-1068, 1079-1086, 1105-1113, 1152-1162, 1168-1179, 1183- 
1191, 1204-1210, 1234-1244, 1286-1295, 1318-1326, 1396-1401, 1451-1460, 1465-1474^ 1477-1483, 1488- 
1494, 1505-1510, 1514-1521, 1552-1565, 1593-1614, 1664-1672, 1677-1685, 1701-1711, 1734-1745, 1758- 
1770, 1784-1798, 1840-1847, 1852-1873, 1885-1891, 1906-1911, 1931-1939, 1957-1970, 1977-1992, 2014- 
2020, 2026-2032, 2116-2134, 1-348, 373-490, 573-767, 903-1043, 1155-1198, 1243-148^ 1550-1595, 1682- 
1719, 1793-1921 and 2008-2110 of Seq ID No 177; 10-35, 39-52, 107-112, 181-188, 226-236, 238-253, 
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258-268, 275-284, 296-310, 326-338, 345-368, 380-389, 391-408, 410-418, 420-429, 444-456, 489-505, 
573-588, 616-623, 637-643, 726-739, 741-767, 785-791, 793-803, 830-847, 867-881, 886-922, 949-956, 
961-980, 988-1004, 1009-1018, 1027-1042, 1051-1069, 1076-1089, 1108-1115, 1123-1135, 1140-1151, 
1164-1179, 1182-1191, 1210-1221, 1223-1234, 1242-1250, 1255-1267, 1281-1292, 1301-1307, 1315-1340, 
1348-1355, 1366-1373, 1381-1413, 1417-1428, 1437-1444, 1453-1463, 1478-1484, 1490-1496, 1498-1503, 
1520-1536, 1538-1546, 1548-1570, 1593-1603, 1612-1625, 1635-1649, 1654-1660, 1670-1687, 1693-1700, 
1705-1711, 1718-1726, 1729-1763, 1790-1813, 1871-1881, 1893-1900, 1907-1935, 1962-1970, 1992-2000, 
2006-2013, 2033-2039, 2045-2051, 2055-2067, 2070-2095, 2097-2110, 2115-2121, 2150-2171, 2174-2180, 
2197-2202, 2206-2228 and 1526-1560 of Seq ID No 178; 4-17, 35-48, 54-76, 78-107, 109-115, 118-127, 
134-140, 145-156, 169-174, 217-226, 232-240, 256-262, 267-273, 316-328, 340-346, 353-360, 402-409, 
416-439, 448-456, 506-531, 540-546, 570-578, 586-593, 595-600, 623-632, 662-667, 674-681, 689-705, 
713-724, 730-740, 757-763, 773-778, 783-796, 829-835, 861-871, 888-899, 907-939, 941-955, 957-969, 
986-1000, 1022-1028, 1036-1044, 1068-1084, 1095-1102, 1118-1124, 1140-1146, 1148-1154, 1168-1181, 
1185-1190, 1197-1207, 1218-1226, 1250-1270, 1272-1281, 1284-1296, 1312-1319, 1351-1358, 1383-1409, 
1422-1428, 1438-1447, 1449-1461, 1482-1489, 1504-1510, 1518-1527, 1529-1537, 1544-1551, 1569-1575, 
1622-1628, 1631-1637, 1682-1689, 1711-1718, 1733-1740, 1772-1783, 1818-1834, 1859-1872, 1-64 and 
128-495 of Seq ID No 179; 8-28, 32-37, 62-69, 119-125, 137-149, 159-164, 173-189, 200-205, 221-229, 
240-245, 258-265, 268-276, 287-293, 296-302, 323^29 and 1-95 of Seq ID No 180; 9-18, 25-38, 49-63, 
65-72, 74-81, 94-117, 131-137, 139-146, 149-158, 162-188, 191-207, 217-225, 237-252, 255-269, 281-293, 
301-326, 332-342, 347-354, 363-370, 373-380, 391-400, 415-424, 441-447 and 75-107 of Seq ID No 181; 
4-24, 64-71, 81-87, 96-116, 121-128, 130-139, 148-155, 166-173, 176-184 203-215, 231-238, 243-248, 256- 
261, 280-286, 288-306, 314^29 and 67-148 of Seq ID No 182; 4-10, 19-37, 46-52, 62-81, 83-89, 115-120, 
134-139, 141-151, 168-186, 197-205, 209-234, 241-252, 322-335, 339-345, 363-379, 385-393, 403-431, 
434-442, 447-454, 459-465, 479-484, 487-496 and 404-420 of Seq ID No 183; 10-35, 46-66, 71-77, 84-93, 
96-122, 138-148, 154-172, 182-213, 221-233, 245-263, 269-275, 295-301, 303-309, 311-320, 324-336, 340- 
348, 351-359, 375-381 and 111-198 of Seq ID No 184; 14-25, 30-42, 47-61, 67-75, 81-91, 98-106, 114^ 
122, 124-135, 148-193, 209-227 and 198-213 of Seq ID No 185; 5-18, 45-50, 82-90, 97-114, 116-136, 
153-161, 163-171, 212-219, 221-227, 240-249, 267-281, 311-317, 328-337, 375-381, 390-395, 430-436, 
449-455, 484-495, 538-543, 548-554, 556-564, 580-586, 596-602 and 493-606 of Seq ID No 186; 9-25, 
28-34, 37-44, 61-68, 75-81, 88-96, 98-111, 119-133, 138-150, 152-163, 168-182, 186-194, 200-205, 216- 
223, 236-245, 257-264, 279-287, 293-304, 311-318, 325-330, 340-346, 353-358, 365-379, 399-409, 444-453 
and 303-391 of Seq ID No 187; 16-36, 55-61, 66-76, 78-102, 121-130, 134-146, 150-212, 221-239, 255- 
276, 289-322, 329-357 and 29-59 of Seq ID No 188; 8-27, 68-74, 77-99, 110-116, 124-141, 171-177, 202- 
217, 221-228, 259-265, 275-290, 293-303, 309-325, 335-343, 345-351, 365-379, 384-394, 406-414, 423-437, 
452-465, 478-507, 525-534, 554-560, 611-624, 628-651, 669-682, 742-747, 767-778, 782-792, 804-812, 
820-836, 79-231 and 359-451 of Seq ID No 189; 5-28, 39-45, 56-62, 67-74, 77-99, 110-117, 124-141, 168- 
176, 200-230, 237-244, 268-279, 287-299, 304-326, 329-335, 348-362, 370-376, 379-384, 390-406, 420-429, 
466-471, 479-489, 495-504, 529-541, 545-553, 561-577, 598-60^ 622-630, 637-658, 672-680, 682-688, 
690-696, 698-709, 712-719, 724-736, 738-746, 759-769, 780-786, 796-804, 813-818, 860-877, 895-904, 
981-997, 1000-1014, 1021-1029, 1-162, 206-224, 254-350, 414-514 and 864-938 of Seq ID No 190; 4-11, 
19-49, 56-66, 68-101, 109-116, 123-145, 156-165, 177-185, 204-221, 226-234, 242-248, 251-256, 259-265, 
282-302, 307-330, 340-349, 355-374, 377-383, 392-400, 422-428, 434442, 462-474 and 266-322 of Seq 
ID No 191; 14-43, 45-57, 64-74, 80-87, 106-127, 131-142, 145-161, 173-180, 182-188, 203-210, 213-219, 
221-243, 245-254, 304-311, 314-320, 342-348, 354-365, 372-378, 394-399, 407-431, 436-448, 459-465, 
470477, 484490, 504-509, 531-537, 590-596, 611-617, 642-647, 723-734, 740-751, 754-762, 764-774, 
782-797, 807-812, 824-831, 838-845, 877-885, 892-898, 900-906, 924-935, 940-946, 982-996, 1006-1016, 
1033-1043, 1051-1056, 1058-1066, 1094-1108, 1119-1126, 1129-1140, 1150-1157, 1167-1174, 1176-1185, 
1188-1201, 1209-1216, 1220-1228, 1231-1237, 1243-1248, 1253-1285, 1288-1297, 1299-1307, 1316-1334, 
1336-1343, 1350-1359, 1365-1381, 1390-1396, 1412-1420, 1427-1439, 1452-1459, 1477-1484, 1493-1512, 
1554-1559, 1570-1578, 1603-1608, 1623-1630, 1654-1659, 1672-1680, 1689-1696, 1705-1711, 1721-1738, 
1752-1757, 1773-1780, 1817-1829, 1844-1851, 1^6-1863, 1883-1895, 1950-1958, 1974-1990, 172-354, 
384-448, 464-644, 648-728 and 1357-1370 of Seq ID No 192; 8-27, 68-74, 77-99, 110-116, 124-141, 169- 
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176, 201-216, 220^227, 258-264, 274-289, 292-302, 308-324, 334-342, 344-350, 364-372, 377-387, 399-407, 
416-429, 445-458, 471-481, 483-500, 518^27, 547-553, 604-617, 621-644, 662-675, 767-77%, 809-816, 15- 
307, 350-448 and 496-620 of Seq ID No 193; 4-17, 24-29, 53-59, 62-84, 109-126, 159-164, 189-204, 208- 
219, 244-249, 274-290, 292-302, 308-324, 334-342, 344-350, 378-389, 391-397, 401-409, 424-432, 447-460, 
470-479, 490-504, 521-529, 538-544, 549-555, 570-577, 583-592, 602-608, 615-630, 635-647, 664-677, 
692-698, 722-731, 733-751, 782-790, 793-799, 56-267, 337-426 and 495-601 of Seq ID No 194; 12-22, 
49-59, 77-89, 111-121, 136-148, 177-186, 207-213, 217-225, 227-253, 259-274, 296-302, 328-333, 343-354, 
374-383, 424-446, 448-457, 468-480, 488^502, 507-522, 544-550, 553-560, 564-572, 587-596, 604-614, 
619-625, 629-635, 638-656, 662-676, 680-692, 697-713, 720-738, 779-786, 833-847, 861-869, 880-895, 
897-902, 911-917, 946-951, 959-967, 984-990, 992-1004, 1021-1040, 1057-1067, 1073-1080 and 381^03 
of Seq ID No 195; 4-10, 26-31, 46-56, 60-66, 70-79, 86-94, 96-102, 109-118, 132-152, 164-187, 193-206, 
217-224 and 81-149 of Seq ID No 196; 4-21, 26-37, 48-60, 71-82, 109-117, 120-128, 130-136, 142-147, 
181-187, 20S-211, 216-223, 247-255, 257-284, 316-325, 373-379, 395-400, 423-435, 448-456, 479-489, 
512-576, 596^25, 641-678, 680-688, 692-715 and 346-453 of Seq ID No 197; 10-16, 25-31, 34-56, 58-69, 
71-89, 94-110, 133-176, 186-193, 208-225, 240-250, 259-266, 302-307, 335-341, 376-383, 410-416 and 
316-407 of Seq ID No 198; 11-29, 42-56, 60-75, 82-88, 95-110, 116-126, 132-143, 145-160, 166-172, 184- 
216 and 123-164 of Seq ID No 199; 11-29, 54-63, 110-117, 139-152, 158-166, 172-180, 186-193, 215-236, 
240-251, 302-323, 330^335, 340-347, 350-366, 374-381 and 252-299 of Seq ID No 200; 18-27, 35-42, 50- 
56, 67-74, 112-136, 141-153, 163-171, 176-189, 205-213, 225-234, 241-247, 253-258, 269-281, 288-298, 
306-324, 326-334, 355-369, 380-387 and 289-320 of Seq ID No 201; 7-15, 19-41, 56-72, 91-112, 114-122, 
139-147, 163-183, 196-209, 258-280, 326-338, 357-363, 391-403, 406-416 and 360-378 of Seq ID No 202; 
11-18, 29-41, 43-49, 95-108, 142-194, 204-212, 216-242, 247-256, 264-273 and 136-149 of Seq ID No 
203; 18-24, 33-40, 65-79, 89-102, 113-119, 130-137, 155-161, 173-179, 183-203, 205-219, 223-231, 245- 
261, 267-274> 296-306, 311-321, 330-341, 344-363, 369-381, 401-408, 415-427, 437-444, 453-464, 472-478, 
484-508, 517-524, 526-532, 543-548 and 59-180 of Seq ID No 204; 5-13, 52-65, 67-73, 97-110, 112-119, 
134-155 and 45-177 of Seq ID No 205; 6-28, 34-43, 57-67, 75^1, 111-128, 132-147, 155-163, 165-176, 
184-194, 208-216, 218-229, 239-252, 271-278, 328-334, 363-376, 381-388, 426-473, 481-488, 492-498, 
507-513, 536-546, 564-582, 590-601, 607-623, 148-269, 420-450 and 610-648 of Seq ID No 206; 4-12, 
20-38, 69-75, 83-88, 123-128, 145-152, 154-161, 183-188, 200-213, 245-250, 266-272, 306-312, 332-339, 
357-369, 383-389, 395-402, 437-453, 455-470, ^7-503 and 1-112 of Seq ID No 207; 35-59, 74-86, 111- 
117, 122-137 and 70-154 of Seq ID No 208; 26-42, 54-61, 65-75, 101-107, 123-130, 137-144, 148-156, 
164-172, 177-192, 213-221, 231-258 and 157-249 of Seq ID No 209; 29^8, 61-67, 77-S7, 94-100, 105- 
111, 118-158 and 1-97 of Seq ID No 210; 7-21, 30-48, 51-58, 60-85, 94-123, 134-156, 160-167, 169-183, 
186-191, 216-229, 237-251, 257-267, 272-282, 287-298 and 220-243 of Seq ID No 211; 6-29, 34-47, 56- 
65, 69-76, 83-90, 123-134. 143-151, 158-178, 197-203, 217-235, 243-263, 303-309, 320-333, 338-348, 367- 
373, 387-393, 407-414, 416-427, 441-457, 473-482, 487-499, 501-509, 514-520, 530-535, 577-583, 590-602, 
605-612, 622-629, 641-670, 678-690, 37-71 and 238-307 of Seq ID No 212; 7-40, 121-132, 148-161, 196- 
202, 209-215, 221-235, 248-255, 271-280, 288-295, 330-339, 395-409, 414-420, 446-451, 475-487, 556-563, 
568-575, 580-586, 588-595, 633-638, 643-648, 652-659, 672-685, 695-700, 710-716, 737-742, 749-754, 
761-767, 775-781, 796-806, 823-835, 850-863, 884-890, 892-900, 902-915, 934-941 and 406-521 of Seq 
ID No 213; 9-18, 24-46, 51-58, 67-77, 85-108, 114-126, 129-137, 139-146, 152-165, 173-182, 188-195, 
197-204, 217-250, 260-274, 296313, 343-366, 368-384, 427-434, 437-446, 449-455, 478-484, 492-506, 
522-527, 562-591, 599-606, 609-618, 625-631, 645-652 and 577-654 of Seq ID No 214; 13-20, 26-37, 41- 
53, 56-65, 81-100, 102-114, 118-127, 163-188, 196-202, 231-238, 245-252, 266-285, 293-298, 301-306 and 
19-78 of Seq ID No 215; 10-23, 32-42, 54-66, 73-91, 106-113, 118-127, 139-152, 164^173, 198-207, 210- 
245, 284-300, 313-318, 330-337, 339-346, 354-361, 387-393, 404-426, 429-439, 441-453, 467-473, 479-485, 
496-509, 536-544, 551-558, 560-566, 569-574, 578-588, 610-615, 627-635, 649-675, 679-690, 698-716, 
722-734, 743-75^ 769-780, 782-787 and 480-550 of Seq ID No 216; 6-39, 42-50, 60-68, 76-83, 114-129, 
147-162, 170-189, 197-205, 217-231, 239-248, 299-305, 338-344, 352-357, 371-377, 380-451, 459-483, 
491-499, 507-523, 537-559, 587-613, 625-681, 689-729, 737-781, 785-809, 817-865, 873-881, 889-939, 
951-975, 983-1027, 1031-1055, 1063-1071, 1079-1099, 1103-1127, 1151-1185, 1197-1261, 1269-1309, 
1317-1333, 1341-1349, 1357-1465, 1469-1513, 1517-1553, 1557-1629, 1637-1669, 1677-1701, 1709-1725, 
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1733-1795. 1823-1849, 1861-1925, 1933-1973, 1981-2025, 2029-2053, 2061-2109, 2117-2125, 2133-2183, 
2195-2219, 2227-2271, 2275-2299, 2307-2315, 2323-2343, 2347-2371, 2395-2429, 2441-2529, 2537-2569, 
2577-2601, 2609-2625, 2633-2695, 2699-2737, 2765-2791, 2803-2867, 2889-2913, 2921-2937, 2945-2969, 
2977-2985, 299S-3009, 3023-3045, 3073-3099, 3111-3167, 3175-3215, 3223-3267, 3271-3295, 3303-3351, 
3359-3367, 3375-3425, 3437-3461, 3469-3513, 3517-3541, 3549-3557, 3565-3585, 3589-3613, 3637-3671, 
3683-3747, 3755-3795, 3803-3819, 3827-3835, 3843-3951, 3955-3999, 4003-4039, 4043-4115, 4123-4143, 
4147-4171, 4195-4229, 4241-4305, 4313-4353, 4361-4377, 4385-4393, 4401-4509, 4513-4557, 4561-4597, 
4601-4718, 4749-4768, 74-171, 452^9 and 2951-3061 of Seq ID No 217; 16-22, 30-51, 70-111, 117-130, 
137-150, 171-178, 180-188, 191-196 and 148-181 of Seq ID No 218; 6-19, 21-46, 50-56, 80-86, 118-126, 
167-186, 189-205, 211-242, 244-267, 273-286, 290-297, 307-316, 320-341 and 34-60 of Seq ID No 219; 
5-26, 33-43, 48-54, 58-63, 78-83, 113-120, 122-128, 143-152, 157-175, 185-192, 211-225, 227-234, 244- 
256, 270-281, 284-290, 304-310, 330-337, 348-355, 362-379, 384-394, 429-445, 450-474, 483-490, 511-520, 
537-546, 548-554, 561-586, 590-604, 613-629, 149-186, 285-431 and 573^9 of Seq ID No 220; 5-26, 
49-59, 61-67, 83-91, 102-111, 145-157, 185-192, 267-272, 279-286, 292-298, 306-312, 134-220, 235-251 
and 254-280 of Seq ID No 221; 5-19, 72-79, 83-92, 119-124, 140-145, 160-165, 167-182, 224-232, 240- 
252, 259-270, 301-310, 313-322, 332-343, 347-367, 384-398, 416-429, 431-446, 454-461 and 1-169 of Seq 
ID No 222; 8-17, 26-31, 56-62, 75-83, 93-103, 125-131, 135-141, 150-19^ 205-217, 233-258, 262-268, 
281-286 and 127-168 of Seq ID No 223; 6-12, 69-75, 108-115, 139-159, 176-182, 194-214 and 46-161 of 
Seq ID No 224; 6-13, 18-27, 39-48, 51-59, 66-73, 79-85, 95-101, 109-116, 118-124, 144-164, 166-177, 
183-193, 197-204, 215-223, 227-236, 242-249, 252-259, 261-270, 289-301, 318-325 and 12-58 of Seq ID 
No 225; 4-10, 26-32, 48-60, 97-105, 117-132, 138-163, 169-185, 192-214, 219-231, 249-261, 264-270, 292- 
308, 343-356, 385-392, 398-404, 408-417, 435-441 and 24-50 of Seq ID No 226; 10-40, 42-48, 51-61, 
119-126 and 1-118 of Seq ID No 227; 5-17, 40-58, 71-83, 103-111, 123-140, 167-177, 188-204 and 116- 
128 of Seq ID No 228; 4-9, 11-50, 57-70, 112-123, 127-138 and 64-107 of Seq ID No 229; 9-39, 51-67 
and 1-101 of Seq ID No 230; 5-14, 17-25, 28-46, 52-59, 85-93, 99-104, 111-120, 122-131, 140-148, 158- 
179, 187-197, 204-225, 271-283, 285-293 and 139-155 of Seq ID No 231; 42-70, 73-90, 92-108, 112-127, 
152-164, 166-172, 181-199, 201-210, 219-228, 247-274, 295-302, 322-334, 336-346, 353-358, 396-414, 
419-425, 432-438, 462-471, 518-523, 531-536, 561-567, 576-589, 594-612, 620-631, 665-671, 697-710, 
718-731, 736-756, 765-771, 784-801 and 626-653 of Seq ID No 232; 8-28, 41-51, 53-62, 68-74, 79-85, 94- 
100, 102-108, 114-120, 130-154, 156-162, 175-180, 198-204, 206-213, 281-294, 308^18, 321-339, 362-368, 
381-386, 393-399, 407-415 and 2-13 of Seq ID No 233; 4-39, 48-65, 93-98, 106-112, 116-129 and 10-36 
of Seq ID No 234; 25-32, 35-50, 66-71, 75-86, 90-96, 123-136, 141-151, 160-179, 190-196, 209-215, 222- 
228, 235-242, 257-263, 270-280 and 209-247 of Seq ID No 235; 5-29, 31-38, 50-57, 62-75, 83-110, 115- 
132, 168-195, 197-206, 216-242, 249-258, 262-269, 333-340, 342-350, 363-368, 376-392, 400-406, 410-421, 
423-430, 436-442, 448-454, 460-466, 471-476, 491-496, 511-516, 531-536, 551-556, 571-576, 585-591, 
599-^05, 27-70, 219-293, 441-504 and 512-584 of Seq ID No 236; 4-12, 14-34, 47-75, 83-104, 107-115, 
133-140, 148-185, 187-196, 207-212, 224-256, 258-265, 281-287, 289-296, 298-308, 325-333, 345-355, 
365-371, 382-395, 424-435, 441-457, 465-472, 483-491, 493-505, 528-534, 536-546, 552-558, 575-584, 
589-600, 616-623 and 576-591 of Seq ID No 237; 4-76, 78-89, 91-126, 142-148, 151-191, 195-208, 
211-223, 226-240, 256-277, 279-285, 290-314, 317-323, 358-377, 381-387, 391-396, 398-411, 415- 
434, 436-446, 454-484> 494-512, 516-523, 538-552, 559-566, 571-577, 579-596, 599-615, 620-627, 
635-644, 694-707, 720-734, 737-759, 761-771 and 313-329 of Seq ID No 238; 7^8, 44-49, 79-89, 99- 
108, 117-123, 125-132, 137-146, 178-187, 207-237, 245-255, 322-337, 365-387, 398-408, 445-462, 603-608, 
623-628, 644-650, 657-671, 673-679 and 111-566 of Seq ID No 239; 6-20, 22-35, 39-45, 58-64, 77-117, 
137-144, 158-163, 205-210, 218-224, 229-236, 239-251, 263-277, 299-307, 323-334, 353-384, 388-396, 
399-438, 443-448, 458-463, 467-478, 481-495, 503-509, 511-526, 559-576, 595-600, 612-645, 711-721, 
723-738, 744-758, 778-807 and 686-720 of Seq ID No 240; 10-33, 35-41, 72-84, 129-138, 158-163, 203- 
226, 243-252, 258-264, 279-302, 322-329, 381-386, 401-406, 414-435 and 184-385 of Seq ID No 241; 4-9, 
19-24, 41-47, 75-85, 105-110, 113-146 and 45-62 of Seq ID No 242; 4-25, 52-67, 117-124, 131-146, 173- 
180, 182-191, 195-206, 215-221, 229-236, 245-252, 258-279, 286-291, 293-302, 314-320, 327-336, 341-353, 
355-361, 383-389 and 1-285 of Seq ID No 243; 14-32, 38-50, 73-84, 93-105, 109-114 and 40-70 of Seq 
ID No 244; 5-26 and 22-34 of Seq ID No 245; 23-28 and 13-39 of Seq ID No 246; 8-14 and 21-34 of 
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Seq ID No 247; 4-13, 20^29, 44-50, 59-74 and 41-69 of Seq ID No 248; 4-9, 19-42, 48-59, 71-83 and 
57-91 of Seq ID No 249; 4-14 and 10-28 of Seq ID No 250; 22-28, 32-42, 63-71, 81-111, 149-156, 158- 
167, 172-180, 182-203, 219-229 and 27-49 of Seq ID No 251; 17-27 and 23-32 of Seq ID No 252; 18-24 
and 28-38 of Seq ID No 253; 9-15 and 13-27 of Seq ID No 254; 13-22 and 18-29 of Seq ID No 255; 
17-26 and 2-11 of Seq ID No 256; 4-33 and 16-32 of Seq ID No 257; 4-10, 37-43, 54-84, 92-127 and 

15- 62 of Seq ID No 258; 4-14, 20-32, 35-60, 69-75, 79-99, 101-109, 116-140 and 124rl36 of Seq ID No 
259; 2-13 of Seq ID No 260; 4rl3, 28-42 and 42-57 of Seq ID No 261; 4-14, 27-44 and 14-35 of Seq ID 
No 262; 4-12 and 1-27 of Seq ID No 263; 4-18, 39-45, 47-74 and 35-66 of Seq ID No 264; 8-20, 43-77 
and 17-36 of Seq ID No 265; 4-30, 35-45, 51-57 and 35-49 of Seq ID No 266; 4-24, 49-57 and 15-34 of 
Seq ID No 267; 4-22 and 8-27 of Seq ID No 268; 13-25, 32-59, 66-80 and 21-55 of Seq ID No 269; 4- 
10, 24-33, 35-42, 54-65, 72-82, 98-108 and 15-30 of Seq ID No 270; 8-19 and 17-47 of Seq ID No 271; 
12-18, 40-46 and 31-52 of Seq ID No 272; 4-20, 35-78, 83-102, 109-122 and 74-86 of Seq ID No 273; 7- 
17, 21-41, 46-63 and 2-20 of Seq ID No 274; 30-37 and 2-33 of Seq ID No 275; 4-13, 17-25 and 1-15 of 
Seq ID No 276; 17-31, 44-51 and 20-51 of Seq ID No 277; 20-30 and 5-23 of Seq ID No 278; 13-33, 
48-71 and 92-110 of Seq ID No 279; 4-9, 50-69, 76-88, 96-106, 113-118 and 12-34 of Seq ID No 280; 4- 
24 and 6-26 of Seq ID No 281; 7-26 and 14-30 of Seq ID No 282; 9-39, 46-68, 75-82, 84-103 and 26-44 
of Seq ID No 283; 4-30, 33-107 and 58-84 of Seq ID No 284; 4-12 and 9-51 of Seq ID No 285; 12-18, 
29-37 and 6-37 of Seq ID No 286; 4-21, 33-52, 64r71 and 16-37 of Seq ID No 287; 9-19 and 2-30 of 
Seq ID No 288; 20-37 of Seq ID No 245; 8-27 of Seq ED No 246; 10-27 of Seq ID No 247; 42-59 and 
52-69 of Seq ID No 248; 63-80 and 74-91 of Seq ID No 249; 11-28 of Seq ID No 250; 28-49 of Seq ID 
No 251; 15-32 of Seq ID No 252; 4-20 of Seq ID No 253; 10-27 of Seq ID No 254; 17-34 of Seq ID 
No 255; 1-18 of Seq ID No 256; 16-33 of Seq ID No 257; 16-36, 30-49 and 43-62 of Seq ID No 258; 
122-139 of Seq ID No 259; 1-18 of Seq ID No 260; 41-58 of Seq ID No 261; 15-35 of Seq ID No 262; 
2-27 of Seq ID No 263; 18-36 of Seq ID No 265; 34-51 of Seq ID No 266; 9-27 of Seq ID No 268; 22- 
47 of Seq ID No 269; 18-36 and 29-47 of Seq ID No 271; 32-52 of Seq ID No 272; 72-89 of Seq ID 
No 273; 3-20 of Seq ID No 274; 3-21 and 15-33 of Seq ID No 275; 1-18 of Seq ID No 276; 6-23 of 
Seq ID No 278; 93-110 of Seq ID No 279; 13-34 of Seq ID No 280; 7-26 and 9-26 of Seq ID No 281; 

16- 33 of Seq ID No 282; 27-44 of Seq ID No 283; 67-84 of Seq ID No 284; 10-33 and 26-50 of Seq ID 
No 285; 7-25 and 19-37 of Seq ID No 286; 17-37 of Seq ID No 287; 3-20 and 13-30 of Seq ID No 288; 
62-80 and 75-93 of Seq ID No 145; 92-108 of Seq ID No 147; 332-349, 177-200 and 1755-1777 of Seq 
ID No 148; 109-133, 149-174, 260-285 and 460-485 of Seq ID No 149; 26-47 and 42-64 of Seq ID No 
150; 22-41, 35-54, 115-130, 306-325, 401-420 and 454-478 of Seq ID No 151; 22-45 of Seq ID No 155; 
156-174, 924-940, 1485-1496, 1447-1462 and 1483-1498 of Seq ID No 160; 457-475 of Seq ID No 161; 
302-325 of Seq ID No 163; 288-305 of Seq ID No 164; 244-266 and 260-282 of Seq ID No 165; 204- 
225 and 220-241 of Seq ID No 166; 324-345, 340-361, 356-377, 372-393 and 388-408 of Seq ID No 
167; 39-64 of Seq ID No 168; 54-76 and 70-92 of Seq ID No 169; 1227-1247, 1539-1559, 1554-1574, 
1569-1589, 1584-1604, 1242-1262, 1272-1292, 1287-1308, 1456-1477, 1472-1494, 1488-1510 and 1505- 
1526 of Seq ID No 170; 351-368 of Seq ID No 172; 179-200, 195-216, 211-232, 227-248 and 243-263 of 
Seq ID No 173; 13-37 of Seq ID No 174; 208-224 of Seq ID No 175; 42-64, 59-81, 304-328, 323-348, 
465-489, 968-992, 1399-1418, 1412-1431 and 2092-2111 of Seq ID No 177; 1528-1547 and 1541-1560 of 
Seq ID No 178; 184-200, 367-388, 382-403, 409-429, 425-444 and 438-457 of Seq ID No 179; 27-50 
and 45-67 of Seq ID No 180; 114-131 and 405-419 of Seq ID No 183; 113-134, 129-150, 145-166, 161- 
182 and 177-198 of Seq ID No 184; 495-515 of Seq ID No 186; 346-358 of Seq ID No 187; 208-224 of 
Seq ID No 190; 178-194, 202-223, 217- 238, 288-308 and 1355-1372 of Seq ID No 192; 57-78 of Seq 
ID No 194; 347-369, 364-386, 381-403, 398-420, 415-437 and 432-452 of Seq ID No 197; 347-372 of 
Seq ID No 198; 147-163 of Seq ID No 199; 263-288 of Seq ID No 200; 361-377 of Seq ID No 202; 82- 
104, 99-121, 116-138, 133-155 and 150-171 of Seq ID No 204; 110-130 and 125-145 of Seq ID No 205; 
613-631, 626-644 and 196-213 of Seq ID No 206; 78-100, 95-117, 112-134 and 129-151 of Seq ID No 
208; 158-180, 175-197, 192-214, 209-231 and 226-248 of Seq ID No 209; 30-50, 45-65 and 60-79 of Seq 
ID No 210; 431-455 and 450-474 of Seq ID No 213; 579-601, 596-618, 613-635 and 630-653 of Seq ID 
No 214; 920-927, 98-119, 114-135, 130-151, 146-167 and 162-182 of Seq ID No 217; 36-59 of Seq ID 
No 219; 194-216 and 381-404 of Seq ID No 220; 236-251 and 255-279 of Seq ID No 221; 80-100 and 
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141-164 of Seq ID No 222; 128-154 of Seq ID No 223; 82-100, 95-116 and 111-134 of Seq ID No 224; 
55-76, 71-92 and 87-110 of Seq ID No 227; 91-106 of Seq ID No 229; 74-96 of Seq ID No 230; 140- 
157 of Seq ID No 231; 4-13 of Seq ID No 233; 41-65 and 499-523 of Seq ID No 236; 122-146, 191- 
215, 288-313, 445-469 and 511-535 of Seq ID No 239; 347-368 of Seq ID No 241; 46-61 of Seq ID No 
242; 15-37, 32-57, 101-121, 115-135, 138-158, 152-172, 220-242 and 236-258 of Seq ID No 243. 

15. A process for producing a S. pneiimmtiae hyperimmune serum reactive antigen or a fragment 
tiiereof according to any one of the claims 11 to 14 comprising expressing the nucleic acid molecule 
according to any one of claims 1 to 7. 

16. A process for producing a cell, which expresses a S. pneumoniae hyperimmune serum reactive 
antigen or a fragment thereof according to any one of the claims 11 to 14 comprising transforming 
or transfecting a suitable host cell with the vector according to daun 8 or daim 9. 

17. A pharmaceutical composition, espedally a vaccine, comprising a hyperimmune serum-reactive 
antigen or a fragment tiiereof, as defined in any one of daims 11 to 14 or a nudeic add molecule 
according to any one of daims 1 to 7. 

18. A pharmaceutical composition, espedally a vaccine, according to daim 17, characterized in that it 
further comprises an immunostimulatory substance, preferably selected from the group 
comprising < polycationic polymers, espedally polycationic peptides, immunostimulatory 
deox5mudeotides (ODNs), peptides containing at least two LysLeuLys motifs, neuroactive 
compounds, espedally human growth hormone, aliurm, Freund's complete or incomplete 
adjuvants or combiiuitions thereof. 

19. Use of a nudeic add molecule according to any one of daims 1 to 7 or a hyperimmune serum- 
reactive antigen or fragment thereof according to any one of daims 11 to 14 for tiie manu^cture of 
a pharmaceutical preparation, espedally for the manufacture of a vaccine against S. pneumoniae 
infection. 

20 . An antibody, or at least an effective part thereof, which binds at least to a selective part of the 
hyperimmime serum-reactive antigen or a fragment thereof according to any one of daims 11 to 
14. 

21. An antibody according to daim 20, wherein the antibody is a monodonal antibody. 

22. An antibody according to daim 20 or 21, wherein said effective part comprises Fab fragments. 

23. An antibody according to any one of daims 20 to 22, wherein the antibody is a chimeric antibody. 

24. An antibody according to any one of daims 20 to 23, wherein the antibody is a humanized 
antibody. 

25. A hybridoma cell line, which produces an antibody according to any one of daims 20 to 24. 

26. A method for producing an antibody according to daim 20, characterized by the following steps: 

• initiating an immune response in a non-human animal by administrating an hyperimmune 
serum-reactive antigen or a fragment thereof as defined in any one of tiie daims 11 to 14, to 
said animal, 

• removing an antibody contaiiung body fluid from said animal, and 

• producing the antibody by subjecting said antibody containing body fluid to further 
purification steps. 
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27. Method for producing an antibody according to daim 21, characterized by the foUowing steps: 

• initiating an immune response in a nonrhuman animal by administrating an hyperimmxme 
serum-reactive antigen or a fragment ttiereof, as defined in any one of the claims 12 to 15, to 
said animal, 

o removing the spleen or spleen cells from said animal, 
o producing hybridoma cells of said spleen or spleen cells, 

o selecting and cloning hybridoma cells specific for said hyperimmune serum-reactive antigens or 
a fragment thereof, 

• producing tiie antibody by cultivation of said cloned hybridoma cells and optionally further 
purification steps. 

28. Use of the antibodies according to any one of claims 20 to 24 for tiie preparation of a medicament 
for treating or preventing S. pneumoniae infections. 

29. An antagonist, which binds to the hyperimmime serum-reactive antigen or a fragment tiiereof 
according to any one of claims 11 to 14. 

30. A mettiod for identifying an antagonist capable of binding to the hyperimmune serum-reactive 
antigen or fragment thereof according to any one of claims 11 to 14 comprising: 

a) contacting an isolated or immobilized hyperimmune serum-reactive antigen or a fragment 
thereof according to any one of claims 11 to 14 with a candidate antagonist imder conditions to 
permit binding of said candidate antagonist to said hyperimmune serum-reactive antigen or 
fragment, in the presence of a component capable of providing a detectable signal in response to 
the binding of the candidate antagonist to said hyperimmune serum reactive antigen or fragment 
thereof; and 

b) detecting the presence or absence of a signal generated in response to the binding of the 
antagonist to the hyperimmune serum reactive antigen or the fragment thereof. 

31. A metiiod for identifying an antagonist capable of reducing or inhibiting the interaction activity of 
a hjrperimmtine serum-reactive antigen or a fragment thereof according to any one of claims 11 to 
14 to its interaction partner comprising: 

a) providing a hyperimmune serum reactive antigen or a hyperimmime 
fragment thereof according to any one of claims 11-14, 

b) providing an interaction partner to said hyperimmune serum reactive antigen or a fragment 
tiiereof, especially an antibody according to any one of the daims 20 to 24, 

c) allowing interaction of said hyperimmune serum reactive antigen or fragment thereof to said 
interaction partner to form a interaction complex, 

d) providing a candidate antagonist, 

e) allowing a comp>etition reaction to occur between the candidate antagonist and the interaction 
complex , 

f) determining whether tiie candidate antagonist inhibits or reduces the interaction activities of the 
hyperimmune serum reactive antigen or the fragment thereof with tiie interaction partner. 

32. Use of any of the hj^perimmune serum reactive antigen or fragment thereof according to any one of 
daims 11 to 14 for the isolation and/or purification and/or identification of an interaction partner of 
said hyperimmune serum reactive antigen or firagment thereof. 

33- A process for in vitro diagnosing a disease related to expression of the hyperimmime serum- 
reactive antigen or a fragment thereof according to any one of claims 11 to 14 comprising 
determining the presence of a nudeic add sequence encoding said hyperimmune serum reactive 
antigen and fragment according to any one of claims 1 to 7 or tiie presence of the hyperimmune 
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serum reactive antigen or fragment thereof according to any one of daims 11-14. 

34. A proc^ for in vitro diagnosis of a bacterial infectioa especially a S. pneumoniae infection, 
comprising analysing for the presence of a nucleic add sequence encoding said hyperimmune 
serum reactive antigen and fragment according to any one of daims 1 to 7 or the presence of the 
h3rperimmune serum reactive antigen or firagment tiiereof according to any one of daims 11 to 14. 

35. Use of the hyperimmime serum reactive antigen or fragment thereof according to any one of 
daims 11 to 14 for ihe generation of a peptide binding to said hyperimmune serum reactive 
antigen or fragment thereof, wherein ttie peptide is sdected from the group comprising anticalines. 

36. Use of the hyperimmune serum-reactive antigen or fragment thereof according to any one of 
daims 11 to 14 for the manufacture of a functional nucleic add, wherein the hmctional nudeic add 
is selected from the group comprising aptamers and spiegelmers. 

37. Use of a nudeic add molecule according to any one of claims 11 to 14 for the manufacture of a 
functional ribonucleic add, wherein the functional ribonucleic add is selected from the group 
comprising ribozymes, antisense nudeic adds and siRNA. 
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SEQUENCE XjZSTXNG 



SeqID 1 

atgtctaaaa 

cgctacctga 

attatgctat 

caaagacgtc 

gataaggaga 

cgagcgaagt 

caaagg 

SeqID 2 

atggat:aaga 

gcgctgtatc 

tatggttttc 

gtggcttttg 

ggtcgggcca 

gat cat at gg 

acgctggata 

gatttgattc 

gagtttgagg 

tttatcgatc 

gtcggtgaaa 

gaaaatattg 

caggcctttt 

ttagaggact 

atgggcttca 

ttggatttta 

cactttgagc 

tgtggggata 

gatttcttag 
caacgttttg 
ctttcgactg 
ttggttgatg 
aaattcttgg 
cttgaaaaac 
gcttttgtcc 
gagatgcagg 
gctggtgagg 
ttgggacttc 
gttttagagc 
attgctaaga 
aagattcata 
ccaaacttgc 
gtgccagagt 
gttttggcgc 
atccatactt 
aacgaccgtc 
ggcttgtcta 
tttgaacgtt 
aagggctatg 
aacttcaata 
gcagcagata 
tatcagacta 
gaattggtag 
gttcctctta 

SeqID 3 

atggggatgg 

actctaggtg 

tgtacagaag 

aaaccactga 

ggtaaatggt 

aaatggtact 

agctggtatt 

agatggttct 

acttggtatt 

cactggtact 



atattgtaca 
tgaaagaacg 
tatttatctt 
agcaattagc 
cagcatttgc 
actattattc 



attgaataat 
acaaaaacgg 
gccaactttt 
agacttgcaa 
taccaagttg 
taagtcgagg 



tcttttattc 
aatcgtttta 
aatttagcgc 
actcagtatc 
aaagatgaag 
gaaaaagttt 



aaaatgaata 
tgggaggggt 
agagttatca 
aaactttgag 
attatgctgc 
atacgattcc 



ccaacgtcgt 
attgattttg 
gcaattactc 
tgatgaaaag 
taaatataca 
tgacttgctt 



aaaaattatt 

agcagttgga 

agttgatgtt 

atgcgggaaa 

agactcctga 

ggattcgtca 

agctagcaga 

agctgacgga 

cctttacgcc 

tcaaggcgct 

agacgggtat 

atggaatgaa 

tgtctaaaac 

tggtctatag 

aacagctaaa 

ctattgttga 

tttttggtga 

agctctatgc 

aaaaaacatc 

gtgtggattt 

tggaggacaa 

atgaaacttt 

aacacttagc 

tcagcgaaaa 

ttgccaagat 

ctgaaaatga 

agtttaatgt 

ctctagaata 

gtctcgctcc 

ttcaatctac 

ctcgctatgt 

aaaatattcc 

gggaggatag 

atatttctaa 

cgacagccat 

gcaatgcceia 

ataatttggg 

ttccaggtat 

tagagaccct 

ttcgtggttt 

ttctcaagat 

agatgctgtt 

agatgaaaaa 

tcgcagatga 



attgattgat 

ccgttttaag 

gagtcattta 

gacgaccttc 

tgagtttcgt 

ctatgatctg 

gcaggatggt 

tgagcatacg 

agattacctc 

catgggtgat 

taagctcttg 

gacttctaag 

actagcgacc 

tggtccagat 

gcaggcttta 

ccaaatcagt 

gaattaccat 

cacagacaag 

tctgagagtt 

gcaggcgcct 

tgaaattgcg 

ctacggtaag 

ttgtaaactt 

tgggcaatta 

ggaaattgct 

gcttgtcatt 

caactcgcct 

cactaagaaa 

tattgctccg 

ttatgtsiatt 

gcaggatttg 

tgcccgattg 

tgtgctactc 

ggatgagcac 

gcgggtcttt 

ggcagttaac 

aattagtcgt 

taaaaactac 

ctttaagcgt 

tgcggagcga 

tgccatgatt 

acaagtgcac 

attggtgaaa 

gaatgaaggg 



gggtcttctg 
aatgtggctg 
ttggagcggg 
cggacagaga 
gagcaatttc 
gctcagtatg 
tttgatatta 
gtggttgaaa 
atggaagaaa 
aagtcggata 
ctggagcatg 
atgaaggaaa 
attgatacca 
gttgaaaatc 
aatgtgtcgt 
caagatatgc 
acggataatt 
cttgagctgt 
tatgacttta 
gcttttgaca 
accatcgcta 
ggtgttaaaa 
gctgttttgg 
gagcttcttt 
gggattatgg 
gaaaaactga 
aagcagttgg 
accaagacag 
attgttaaga 
ggcttgcagg 
acccagaccg 
gaacaggggc 
agctctgact 
ttgattaagg 
ggcattgagc 
tttggagtgg 
aaggaagcca 
atggatgaag 
cgccgtgagt 
actgctatca 
cagctggata 
gatgaaatcg 
caaaccatgg 
gcaacGtggt 



cagcttttaa 
atgatgcttc 
tgactgcctc 
aagattttag 
atttctatga 
atttgaatga 
acttgagcaa 
actttgacgg 
accttgacga 
atgcctacgg 



aaatcctaac 
ttcagaggaa 
aaacctttca 
agcgtctacg 
gtctggtgat 
cttaggtgtc 
ttcaggtgct 
ctcaggagct 
agcaggtatc 
ttcaggagct 



aatcaataca 
ttggctggta 
acagttaaaa 
tctgatcagt 
gtgaagacag 
atgcagactg 
atgtttacag 
atgaagacag 
atgaagacag 
ttggctgtga 



tagcttttcg 
gtttgcatac 
ttgagccgag 
tgtatgcgga 
ctttcattcg 
aggcggatga 
ctattgtcag 
tttccaagaa 
tgggcctcac 
atatccctgg 
gttcgcttga 
atctcatcoia 
aggcaccgat 
ttgggaaatt 
cagctgatgt 
tgagtgaaga 
tggttggatt 
tgcaagaccc 
agaaggttaa 
tccgtttggc 
gtctttatgg 
aggccattcc 
tagaaacaga 
atgatatgga 
tcaagaaaga 
ctcaagagat 
gcgtgcttct 
gttattcgac 
aaatcctgga 
actggatttt 
ggcgtttgtc 
gcttgattcg 
attcacagat 
ccttccaaga 
gtcctgatga 
tttatgggat 
aagcctacat 
tggtgcggga 
tgccagatat 
actcacctat 
aagccttggt 
tccttgaagt 
aagaagccat 
acgaggctaa 



aagctattac 
gatatggttc 
ctaaagctac 
ctggttgggt 
gttgggtgaa 
gatttgtaaa 
gctggggaac 
gctggtacaa 
gttggtttaa 
gcacaaceiac 



ggcgtttttt 
caatgcgatt 
tcatattttg 
ctataagggt 
tgagttgctg 
catcattggg 
tggggacaag 
aggtgtggct 
accagctcag 
ggtgaccaaa 
ggggatttat 
tgacaaggaa 
tgcgattggt 
ctacgatgag 
gtctgagagt 
gtctatcttc 
tgtctggtct 
gattttcaag 
agttcttttg 
taaatacctc 
tcagacttac 
tgaacgtgag 
gcctatttta 
gcaacctctg 
gaccttgctt 
ttacgagctg 
ctttgagaaa 
agcagtggat 
ttaccgtcaa 
ggctgatgga 
tagtgtggat 
gaaggctttt 
tgaattgcgc 

gggggcagat 

tgtgactgca 
ttcagacttt 
tgatacctac 
ggcgcgtgat 
caattcgcgc 
ccagggttcg 
tgcaggtggt 
gcctaaatct 
tcaactcagt 



aattgctcaa 
tgctgttcag 
ggttgtagaa 
ggaatctaat 
aacagatggt 
attttctggt 
agatggtagc 
ggaaaatggc 
agtcggacca 
accagatggt 



60 
120 
180 
240 
300 
360 
366 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2631 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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taccgtgtaa atggtaatgg tgaatgggta aac 633 
SeqXD 4 

atgagccgaa aaagcattgg tgagaaacgc catagtttct cgatgagasia gttgtcagtg 60 

ggattggtat cagttactgt atctagtttc tttttgatga gtcaagggat tcaatcggta 120 

tcggccgata atatggaaag tccaattcat tataagtata tgaccgaggg taaattgaca 180 

gacgaggaaa aatccttgct ggtagaggcc cttccacaac tggctgeiaga atcagatgat 240 

acttattact tggtttatag atctcaacag tttttaccga atacaggttt taacccaact: 300 

gttggtactt tcctttttac tgcaggattg agcttgttag ttttattggt ttctaaaagg 360 

gaaaatggaa agaaacgact tgttcatttt ctgctgttga ctagcatggg agttceiattg 420 

ttgccggcca gtgcttttgg gttgaccagc cagattttat ctgcctataa tagtcagctt; 480 

tctatcggag tcggggaaca tittaccagag cctctgaeiaa tcgaaggtta tcaatatatt 540 

ggttatatca aaactaagaa acaggataat acagagcttt caaggacagt tgatgggaaa 600 

tactctgctc aaagagatag tcaaccaaac tctacaeiaaa catcagatgt agttcattca 660 

gctgatttag aatggaacca aggacagggg aaggttagtt tacaaggtga agcatcaggg 720 

gatgatggac tttcagaaaa atcttctata gcagcagaca atctatcttc taatgattca 780 

ttcgcaagtc aagttgagca gaatccggat cacaaaggag aatctgtagt tcgaccaaca 840 

gtgccagaac aaggaaatcc tgtgtctgct acaacggtgc agagtgcgga agaggaagta 900 

ttggcgacga caaatgatcg accagagtat aaacttccat tggaeiaccaa aggcacgcaa 960 

gaacccggtc atgagggtga agccgcagtc cgtgaagact taccagtcta cactaagcca 1020 

ctagaaacca aaggtacaca aggacccgga catgaaggtg aagctgcagt tcgcgaggaa 1080 

gaaccagctt acacagaacc gttagcaacg aaaggcacgc aagagccagg tcatgagggc 1140 

aaagctacag tccgcgaaga gactctagag tacacggaac cggtagcgac aaaaggcaca 1200 

caagaacccg aacat:gaggg cgaagcggca gtagaagaag aacttccggc tttagaggtc 1260 

actacacgaa atagaacgga aatccagaat attccttata caacagaaga aattcaggat 1320 

ccaacacttc tgaaaaatcg tcgtaagatt gaacgacaag ggcaagcagg gacacgtaca 1380 

attcaatatg aagactacat cgtaaatggt aatgtcgtag aaactaaaga agtgtcacga 1440 

act:gaagtag ct.ccggt.caa cgaagtcgtt aaagtaggaa cacttgt.gaa agttaaacct 1500 

acagtagaaa ttacaaactt aacaaaagtt gagaacaaaa aatctataac tgtaagttat 1560 

aacttaatag acactacctc agcatatgtt tctgcaaaaa cgcaagtttt ccatggagac 1620 

aagctagttia aagaggtgga t:atagaaaat cctgccasiag agcaagtaat atcaggttta 1680 

gattactaca caccgtatac agbtaaaaca cacctaactt ataatttggg tgaaaataat 1740 

gaggaaaata ctgaaacatc aactcaagat ttccaattag agt:ataagaa aatagagatt 1800 

aaagatattg attcagtaga attatacggt aaagaaaatg atcgttatcg tagatattta 1860 

agtctaagtg aagcgccgac t:gatacggct aeiatactttg taaaagtgaa atcagatcgc 1920 

ttcaaagaaa tigtacctacc tgtaaaatct attacagasia atacggatgg aacgtataaa 1980 

gtigacggtag ccgttgatca acttgtcgaa gaaggtacag acggttacaa agatgattac 2040 

acat:ttact:g tiagctaaatc t:aaagcagag caaccaggag tttacacatc ctttaaacag 2100 

cbggtaacag ccatgcaaag caatcbgtict ggtgtcbata cattggcttc agatabgacc 2160 

gcagatgagg tgagcttagg cgataagcag acaagttatc tcacaggtgc atttacaggg 2220 

agcbtgatcg gbbcbgatgg aacaaaatcg tatgccattt atgatttgaa gaaaccatta 2280 

tttgatacat taaatggtgc tacagttaga gatttggata ttaaaactgt ttctgctgat 2340 

agtiaaagaaa atgtcgcagc gctggcgaag gcagcgaata gcgcgaatat taataatgtt 2400 

gcagtagaag gaaaaatctc aggtgcgaaa tctgttgcgg gattagtagc gagcgcaaca 2460 

aatacagtga tagaaaacag ctcgtttaca gggaaactta tcgcaaatca ccaggacagt 2520 

aataaaaatg atactggagg aatagtaggt aatataacag gaaatagttc gagagttaat 2580 

eiaagttaggg tagatgcctt aatctctact aatgcacgca ataataacca aacagctgga 2640 

gggatagtag gtagattaga aaatggtgca ttgatatcta attcggttgc tactggagaa 2700 

atacgaaatg gtcsiaggata ttctagagtc ggaggaatag taggatctac gtggcaaaac 2760 

ggbcgagtaa ataatgttgt gagtsiacgta gatgttggag atggttatgt tatcaccggt 2820 

gatcaatacg cagcagcaga tgtgaaaaat gcaagtacat cagttgataa tagaaaagca 2880 

gacagattcg ctacaaaatt atcaaaagac caaatagacg cgaaagttgc tgattatgga 2940 

atcacagtaa ctcttgatga tactgggcaa gatttaaaac gtaatctaag agaagttgat 3000 

tabacaagac tiaaataaagc agaagctgaa agaaaagtag cttatagcaa catagaaciaa 3060 

ctgatgccat: tctacaataa agaccbagta gbtcactatg gtaacaaagt agcgacaaca 3120 

gataaacttt acactacaga attgttagat gttgtgccga tgaaagatga tgaagtagta 3180 

acggatatta ataataagaa aaattceiata aataaagtta tgttacattt caaagataat 3240 

acagtagaat acctagatgt aacattcaaa gaaeiacttca taaacagtca agtaatcgaa 3300 

tacaatgtta caggaaaaga atatatattc acaccagaag catttgtttc agactataca 3360 

gcgabaacga ataacgtact aagcgacttg caaaatgtaa cacttaactc agaagctact 3420 

aaaaaagtac taggagcagc gaatgatgca gccttagata acctatactt agatagacaa 3480 

tttgaagaag ttaaagctaa tatagcagaa cacctaagaa aagtattagc gatggataaa 3540 

tcaatcaata ctacaggaga cggtgtagtt gaatacgtaa gtgagaaaat caaaaataac 3600 

aaagaagcat ttatgctagg tcttacttat atgaaccgtt ggtacgatat taattatggt 3660 

aaaatgaata caaaagattt atctacgtac aagtttgact ttaacggaaa taatgagact 3720 

tcaacgttgg atactattgt cgcattagga aatagtggac tagataacct gagagcttca 3780 

aabactgtag gtttatatgc gaataaactt gcatcggbaa aaggagaaga ttcagtcttt 3840 

gacttcgtag aagcgtatag aeiaactgttc btaccaaaca aaacaaabaa cgagbggttb 3900 
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asiagaaaata caaaggcata tatagtcgaa atgaagtctg atattgcaga agtacgagaa 3960 

aaacaagaat caccaacagc cgatagaaaa tattcattag gagtttacga tagaatatca 4020 

gcaccaagtt gggggcataa gagtatgtta ttaccactac taactttacc tgaagaatct 4080 

gtgtatattt catcgaatat gtctacactt gcattcggtt cgtatgaaag atatcgtgat 4140 

agtgtggatg gagttattct ttcaggagat gctttacgaa cttatgtaag aaatagagtt 4200 

gatatagcag cgaaaaggca tagagaccat tatgatattt ggtacaatct tcttgacagt 4260 

gcttcaaaag aaaaactttt ccgttctgtg atagtttatg atggattcaa tgtaaaagat 4320 

gagacaggaa gaacttattg ggcaaggtta acggataaaa acatcggctc tattaaagaa 4380 

ttcttcggac ctgttgggaa atggtatgag tataatagta gtgcaggagc gtatgcgaat 4440 

ggaagtttaa cgcactttgt gttagataga ttattagatg cttatggaac gtcggtttat 4500 

actcatgaaa tggttcataa ttctgattct gcaatctact ttgaaggaaa tggtagacgt 4560 

gaaggattgg gagcggagtt atacgcactt ggtttactgc aatctgtaga tagtgtaaat 4620 

tctcatattt tagctttaaa tacgttatat aaagcagaaa aagatgattt gaatagattg 4680 

catacatata atccggtgga acgtttcgat tcggatgagg cgcttcaaag ttatatgcat 4740 

ggatcatatg atgtaatgta tacacttgat gcgatggaag caaaagcgat attagctcaa 4800 

aataatgatg ttaagaaaaa atggtttaga aaaatagaaa attattacgt tcgtgatact 4860 

agacataata aagatacaca tgcaggaaat aaagtccgtc cattaacaga tgaagaagta 4920 

gctaacttaa catcgttaaa ctcattaatc gacaacgaca tcataaatag acgtagctat 4980 

gatgatagta gagaatataa acgaaatggc tactatacta taagtatgtt ctctcctgta 5040 

tacgcagcgc taagcaattc gaaaggtgct cctggagata ttatgtttag aaaaatagct 5100 

tatgaattac ttgcggaaaa aggttatcac aaaggattcc taccttatgt ttctaatcag 5160 

tacggagcag aagcatttgc cagcggaagc aiaaacattct catcatggca tggaagagat 5220 

gttgctttag tgacagatga tttagtattt aagaaagtat tcaatggtga gtactcatca 5280 

tgggctgatt tcaaaaaagc aatgtttaaa caacgtatag ataaacaaga taatctgaaa 5340 

ccaataacaa ttcaatacga attaggtaat cctaatagta caaaagaagt aactataaca 5400 

acggctgcac aaatgcciaca attaattaat gaagcggctg cgaaagatat tactaatata 5460 

gatcgtgcaa cgagticatac cccagcaagt tgggtgcatt tattaaaaca ciaaaatctat 5520 

aatgcatatc ttcgcactac agatgacttt agaaattcta tatataaa 5568 

SeqXD 5 

atgaaattca atccaaatca aagatatact cgttggtcta ttcgccgtct cagtgtcggt 60 

gttgcctcag ttgttgtggc tagtggcttc tttgtcctag ttggtcagcc aagttctgta 120 

cgtgccgatg ggctcaatcc aaccccaggt caagtcttac ctgaagagac atcgggaacg 180 

aaagagggtg acttatcaga aaaaccagga gacaccgttc tcactcaagc gaaacctgag 240 

ggcgttactg gaaatacgaa ttcacttccg acacctacag aaagaactga agtgagcgag 300 

gaaacaagcc cttctagtct ggatacactt tttgaaaaag atgaagaagc tcaaaaaaat 360 

ccagagctaa cagatgtctt aaaagaaact gtagatacag ctgatgtgga tgggacacaa 420 

gcaagtccag cagaaactac tcctgaacaa gtaaaaggtg gagtgaaaga aaatacaaaa 480 

gacagcatcg atgttcctgc tgcttatctt gaaaaagctg aagggaaagg tcctttcact 54 0 

gccggtgtaa accaagtaat tccttatgaa ctattcgctg gtgatggtat gttaactcgt 600 

ctattactaa aagcttcgga taatgctcct tggtctgaca atggtactgc taaaaatcct 660 

gctttacctc ctcttgaagg attaacaaaa gggaaatact tctatgaagt agacttaaat 720 

ggceiatactg ttggtaaaca aggtcaagct ttaattgatc aacttcgcgc taatggtact 780 

caaacttata aagctactgt: taaagtttac ggaaataaag acggtaaagc tgacttgact 840 

aatctagttg ctactaaaaa tgtagacatc aacatcaatg gattagttgc taaagaaaca 900 

gttcaaaaag ccgttgcaga caacgttaaa gacagtatcg atgttccagc agcctaccta 960 

gaaaaagcca agggtgaagg tccattcaca gcaggtgtca accatgtgat tccatacgaa 1020 

ctcttcgcag gtgatggcat gttgactcgt ctcttgctca aggcatctga caaggcacca 1080 

tggtcagata acggcgacgc taaaaaccca gccctatctc cactaggcga aaacgtgaag 1140 

accaaaggtc aatacttcta tcaagtagcc ttggacggaa atgtagctgg caaagaaaaa 1200 

caagcgctca ttgaccagtt ccgagcaaat ggtactcaaa cttacagcgc tacagtcaat 1260 

gtctatggta acaaagacgg taaaccagac ttggacaaca tcgtagcaac taaaaaagtc 1320 

actattaaca taaacggttt aatttctaaa gaaacagttc aaaaagccgt tgcagacaac 1380 

gtteiaagaca gtatcgatgt tccagcagcc tacctagaaa aagccaaggg tgaaggtcca 144 0 

ttcacagcag gtgtcaacca tgtgattcca tacgaactct tcgcaggtga tggtatgttg 1500 

actcgtctct tgctcaaggc atctgacaag gcaccatggt cagataacgg tgacgctaaa 1560 

aacccagccc tatctccact aggtgaaaac gtgaagacca aaggtcaata cttctatcaa 1620 

ttagccttgg acggaaatgt agctggcaaa gaaaaacaag cgctcattga ccagttccga 1680 

gcaaacggta ctcaaactta cagcgctaca gtcaatgtct atggtaacaa agacggtaaa 1740 

ccagacttgg acaacatcgt agcaactaaa aaagtcacta ttaacataaa cggtttaatt 1800 

tctaaagaaa cagttcaaaa agccgttgca gacaacgtta aggacagtat cgatgttcca 1860 

gcagcctacc tagaaaaggc caagggtgaa ggtccattca cagcaggtgt caaccatgtg 1920 

attccatacg aactcttcgc aggtgatggc atgttgactc gtctcttgct caaggcatct 1980 

gacaaggcac catggtcaga taacggcgac gctaaaaacc cagctctatc tccactaggt 2040 

gaaaacgtga agaccaaagg tcaatacttc tatcaagtag ccttggacgg aaatgiiagct 2100 

ggcaaagaaa aacaagcgct cattgaccag ttccgagcaa acggtactca aacttacagc 2160 

gctacagtca atgtctiatgg taacaaagac ggtaaaccag acttggacaa catcgtagca 2220 

actaaaaaag tcactattaa gataaatgtt aaagaaacat cagacacagc aaatggttca 2280 
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ttatcacctt ctaactctgg ttctggcgtg actccgatga atcacaatca tgctacaggt 2340 

actacagata gcatgcctgc tgacaccatg acaagttcta ccaacacgat ggcaggtgaa 2400 

aacatggctg cttctgctaa caagatgtct gatacgatga tgtcagagga taaagctatg 2460 

ctaccaaata ctggtgagac tcaaacatca atggcaagta ttggtttcct tgggcttgcg 2520 

cttgcaggtt tactcggtgg tctaggtttg aaaaacaaaa aagaagaaaa c 2571 



SeqID 6 

atgaaatcaa taactaaaaa gattaaagca actcttgcag gagtagctgc cttgtttgca 60 

gtatttgctc catcatttgt atctgctcaa gouitcatcaa cttacactgt taaagaaggt 120 

gatacacttt cagaaatcgc tgaaactcac aacacaacag ttgaaaaatt ggcagaaaac 180 

aaccacattg ataiacattca tttgatttat gttgatcaag agttggttat cgatggccct 240 

gtagcgcctg ttgcsiacacc agcgccagct acttatgcgg caccagccgc tcaagatgaa 300 

actgtttcag ctccagtagc agaaactcca gtagtaagtg aaacagttgt ttcaactgta 360 

agcggatctg aagcagaagc caaagaatgg atcgctcaaa aagaatcagg tggtagctat 420 

acagctacaa atggacgtta tatcggacgt taccaattaa cagattcata cctgaacggt 480 

gactactcag ctgaaaacca agaacgtgta gcagatgcct acgttgcagg acgttacggt 540 

tcatggactg ctgctaaaaa cttctggctt ucaatggct ggtat 585 



SeqID 7 

atgaataaga 

tttgttacgt 

fccttcattag 

tacgaaacgg 

agaactgagg 

cttgttgttc 

aaatctgacg 

aggaaagagc 

gaaccaaatg 

gtagctaaga 

gaggctaagg 

gttgctactg 

gatgatggca 

aaacaagctg 

cctgaaggta 

aaagctgatg 

atattacttg 

gctaaaaaag 

gatcctgaag 

aaaaaagctg 

gaaatattac 

gctactaaaa 

ttaggccctg 

gctcctgcac 

gcaccaaaac 

aaaccagagc 

ccagccactc 

gatggttcaa 

aacggcgcta 

tcaggtgcta 

agcaatggcg 

gctaatggtg 

gctaatggtg 

gctaacggtg 

gctiaacggtt 

gcatcaggt:g 

aatggcttag 

ggtgaatggg 



aaaaaatgat 
ctcagcctac 
agaagaaat:a 
ctaaaaagaa 
agaaagctcg 
aaaatgcat^a 
ctgaatatca 
aacaggactt 
cgttggctga 
gaaaatatga 
aacttgaaat 
ctcaacatca 
cagaagttat 
agttagcaaa 
agactcagga 
aacttcaaaa 
gaggggctga 
ctgagttagc 
gtaagactca 
atgaacttca 
ttggaggggc 
aagctgaatt 
at.ggagat.ga 
caaaaccaga 
cagagcaacc 
aaccagctaa 
caaaaacagg 
tggcaatagg 
tggcaacagg 
tgaaagcaag 
ctatggcgac 
atatggcgac 
atatggcgac 
ctatggctac 
caatggcaac 
ctatgaaagc 
gtgcccttgc 
tt 



tttaacaagt 
ttttgtciaga 
tgaggaagca 
agcagaagac 
aaaagaagca 
taaagagtac 
gaaaaaat:ta 
gcaaaataaa 
gactaagaaa 
ttatgcaact 
tgaaaaactt 
agtagataat 
agaagctaaa 
aaaacaaaca 
tgaattagat 
taaagttgct 
tcctgaagat 
aaaaaaac aa 
ggatgaatta 
aaataaagtt 
tgattctgaa 
ggaaaaaact 
agaagaaact 
gcaaccagct 
agctccagct 
gccggagaaa 
ctggaaacaa 
ttggctccaa 
ttgggtgaaa 
Gcaatggttc 
aggctggctc 
aggatggctc 
aggatgggct 
aggttgggct 
aggttgggtg 
aagccaatgg 
agtcaacaca 



ctagccagcg 
gcagaagaat 
aaagcaaaag 
gctcagaaaa 
gaagcatctc 
cgagaagttc 
acagaggtcg 
tttaatgaag 
aaagcagaag 
ctaaaggtag 
caatatgaaa 
ttgaeiaaaac 
ttaaaaaaag 
gaacttgaaa 
aaagaagcag 
gatttagaaa 
gatactgctg 
acagaacttg 
gataaagaag 
gctgatttag 
gatgatactg 
caaaaagaat 
ccagcgccgg 
ccagctccaa 
ccaaaaccag 
ccagctgaag 
gaaaacggta 
aacaacggtt 
gatggagata 
aaagtatcag 
caatacaatg 
caatacaacg 
aaagtcaacg 
aaagtcaacg 
aaagatggag 
ttcaaagtat 
actgtagatg 



tcgctatctt 

ctccacaagt 

ctgatactgc 

agtatgaaga 

aaaaattgaa 

aaaatcaacg 

actctaaaat 

taagagcagt 

aagctaaagc 

cactagcgaa 

tttctacttt 

ttcttgctgg 

gagaagctga 

aacttcttga 

aagaagctga 

aagaaattag 

ctcttcaaaa 

aaaaacttct 

cagaagaagc 

aaaaagaaat 

ctgctcttca 

tagatgcagc 

ctcctcaacc 

aaccagagca 

agcaaccagc 

agcctactca 

tgtggtattt 

catggtacta 

cctggtacta 

ataaatggta 

gctcatggta 

gttcatggta 

gttcatggta 

gttcatggta 

atacctggta 

cagataaatg 

gctataaagt 



aggggctggt 
tgtcgaaaaa 
Cciagaaagat 
tgatcagaag 
tgatgtggcg 
tagtaaatat 
agagaaggct 
tgtagttcct 
agaagaaaeia 
gaaagaagta 
ggaacaagaa 
tgcggatcct 
gctaaacgct 
cagccttgat 
gttggataaia 
taaccttgaa 
taaattagct 
tgacagcctt 
tgagttggat 
tagtaacctt 
aaataaatta 
tcttaatgag 
agagcaacca 
accagctcct 
tccagctcca 
accagaaaaa 
ctacaatact 
cctaaacgct 
tcttgaagca 
ctatgtcaac 
Gtacctcaac 
ttacctcaac 
ctacctaaac 
ctacctaaac 
ctatcttgaa 
gtactatgtc 
caatgccaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2232 



SeqXD 8 

atgaaaaaaa tagttcttgt tagtctagct ttcctttttg tcctggttgg ttgcggacag 60 

aaaaaagaaa ctggaccagc tacaaaaaca gaaaaagata cgcttcagtc ggcattgcca 120 

gttattgaaa atgctgagaa gaatacagtt gtaactaaga ctttggtctt gcccaagtca 180 

gatgatggta gccagcagac acaaacaatt acttacaaag acaagacttt tttgagtcta 240 

gctatccaac siaaaacgtcc agtctctgat gagttgaaga cttatattga ccaacatgga 300 

gtggaggaaa ctcaaaaagc tcttcttgaa gcggaggaga aggataagtc tatcattgaa 360 

gctcgtaaat tggcaggttt caaacttgaa acaaaactat tgagcgcaac ggaacttcaa 420 

acaacgacta gttttgattt tcaagttctg gatgtcaaga aggcttccca gttggaacat 480 

ctgaagaata ttggtttgga aaatcttttg aaaaatgaac caagcaaata tatttcagat 540 
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agattggcaa atggcgcgac agsLacaa 567 
SeqXD 9 

atgtttgaag tagaagaatg gctccatagt cggattggtt tgaattttcg atcaggtttg 60 

ggtcgaatgc agcaagcggt ggatttgtta ggaaatcctg agcagtctta ccctattatc 120 

cacgtaacag ggactaatgg gaaaggatct accattgctt ttatgaggga attatttatg 180 

gggcatggca aeiaaagttgc gacctttacc tcccctcata tcgtctctat caatgaccga 240 

atctgcatta atgggcagcc tatagcagac gcagacttta tccgtttgac tgatcaggtc 300 

aaggagatgg agaaaacgct tctgcsiaact cctgcccagt tgtccttttt tgaabtgctg 360 

accttggttg cttttcttta ttttagggag caggaggtgg atttggtttt attagaagtg 420 

ggaattggtg gcttacttga cacgaccaat gtggtaactg gagagtttgc tgtcatcacc 480 

tccattgggc ttgaccatca agaaaccttg ggtgatagtc tagaagcaat tgcagagcag 540 

aaagctggta ttttcaaggc tggtaagaag gcagtgattg cgaaattgcc tccagaagct 600 

aggcttgcct gtcagaaaaa agccgaatct ttagctgtta acctttatca ggcaggtcaa 660 

gattttttaa tgctgaatgg tgatttttca agctctttac taaatctttc acagctgaac 720 

ataggcttag aaggagtcta tcagcaggag aatgcagcct tggcgttgca aacttttctt 780 

ctttttatga gagaaagaaa ggaagctgtt gatgaacagg ctgtaagaaa ggccttggaa 840 

cagacccatt gggctggtcg cttggagcgt attcgcccac agatttattt ggatggtgct 900 

cataacctcc ctgccttgac tcgcttggct gagtttatca aagaaaaaga gcaggaaggc 960 

tatcgacctc aaatcctctt tggatccttg aaacgtaagg attatcaagg gatgttgggt 1020 

tatctgactg aaaaattgcc tcaggtggaa ctcaaggtga ccggctttga ctatcagggg 1080 

gctttggacg aaagggatgt gacaggttac gatatagttt cttcttaccg agaatttatc 1140 

agcgattttg ugeiaagggc agacgctcaa gacttgctgt tcgttacagg gtctctctat 1200 

tttatctcag aagtacgggg ctacctgctg gaccgtgagc agataaat 1248 

SeqID 10 

gtgggaattc gtgtttataa accaacaaca aacggtcgcc gtaatatgac ttctttggat 60 

ttcgctgaaa tcacaacaag cactcctgaa siaatcattgc ttgttgcatt gaagagcaag 120 

gctggtcgta acaacaacgg tcgtatcaca gttcgtcacc aaggtggtgg acacaaacgt 180 

ttctaccgtt tggttgactt caaacgtaat 8uiagacaacg ttgaagcagt tgttaaaaca 240 

atcgagtacg atccsuaaccg ttctgcaaac atcgctcttg tacactacac tgacggtgtg 300 

aaagcataca tcatcgctcc aaaaggtctt gaagtaggtc aacgtatcgt ttcaggtcca 360 

gaagcagata tcaaagtcgg aaacgctctt ccacttgcta acatcccagt tggtactttg 420 

attcacaaca tcgagttgaa accaggtcgt ggtggtgaat tggtacgtgc tgctggtgca 480 

tctgctcaag tattgggttc tgaaggtaaa tatgttcttg ttcgtcttca atcaggtgaa 540 

gttcgtatga ttcttggaac ttgccgtgct acagttggtg ttgtcggaaa cgaacaacat 600 

ggacttgtaa accttggtaa agcaggacgt agccgttgga aaggtatccg cccaacagtt 660 

cgtggttctg taatgaaccc taacgatcac ccacacggtg gtggtgaagg taaagcacca 720 

gttggtcgta aagcaccatc tactccatgg ggcaaacctg ctcttggtct taaaactcgt 780 

aacaagaaag cgaaatctga caaacttatc gttcgtcgtc gcaacgagaa a 831 

SegID 11 

atggctaaaa aatcaatggt agctagagag gctaaacgcc aaiaaaattgt tgaccgttat 60 

gctgaaaaac gtgctgcatt aaaggcggca ggggactacg aaggtttatc taaattacct 120 

cgcaacgcct caccgactcg tttacataat cgttgtaggg ttacggggcg cccacattca 180 

gttbaccgca aat:tt:ggt:ct gagtcgtatc gcttttcgcg aacttgcgca taaaggtcaa 240 

attcctggtg taacaaaagc atcttgg 267 

SeqiD 12 

atggatatta gacaagttiac tgaaaccatc gccatgattg aggagcaaaa cttcgatatt 60 

agaaccatta ccatggggat ttctcttttg gactgtatcg atccagatat caatcgtgct 120 

gcggagaaaa tctatcaaaa aattacgaca aaggcggcta atttagtagc tgttggtgat 180 

gaaattgcgg ctgagttggg aabtcctatc gttaataagc gtgtatcggt gacacctatt 240 

tctctgattg gggcagcgac agatgcgacg gactacgtgg ttctggcaaa agcgcttgat 300 

aaggctgcga aagagattgg tgtggacttt attggtggtt tttctgcctt agtacaaaaa 360 

ggbtatcaaa agggagatga gattctcatc aattccattc ctcgcgcttt ggctgagacg 420 

gataaggtcb gctcgtcagt caatatcggc tcaaccaagt ctggtattaa tatgacggct 480 

gtggcagata tgggacgaat tatcaaggaa acagcaaatc tttcagatat gggagtggcc 540 

aagttggttg tattcgctaa tgctgttgag gacaatccat ttatggcggg tgcctttcat 600 

ggtgttgggg aagcagatgt tatcatcaat gtcggagttt ctggtcctgg tgttgtgaaa 660 

cgtgctttgg aasiaagttcg tggacagagc tttgatgtag tagccgaaac agttaagaaa 720 

actgccttta aaatcactcg tatcggbcaa ttggttggtc aaatggccag tgagagactg 780 

ggtfftggagt ttggtattgt ggacbtgagt ttggcaccaa cccctgcggt tggagactct 840 

gtggcacgtg tccttgagga aatggggcta gaaacagttg gcacgcatgg aacgacggct 900 

gccttggccc tcttgaacga ccaagttaaa aagggtggag tgatggcctg caaccaagtc 960 

ggtggtttat ctggtgcctt tatccctgtt tctgaggatg aaggaatgat tgctgcagtg 1020 

caaaatggct ctcttaattt agaaaaacta gaagctatga cggctatctg ttctgttgga 1080 

ttggatatga ttgccatccc agaagatacg cctgctgacia ctattgcggc tatgattgcg 1140 
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gatgaagcag caatcggtgt tatcaacatg aaaacaacag ctgttcgtat cattcccaaa 1200 

ggaeiaagaag gcgatatgat tgagtttggt ggtctattag gaactgcacc cgttatgaag 1260 

gttaatgggg cttcgtctgt cgacttcatc tctcgcggtg gacasLatccc agcaccaatt 1320 

catagtttta aaaat 1335 

SeqID 13 

atggtaaata cagaagtagc aagaacaaca atcaagacag aatattttgg cagccttact 60 

gaaaggatga acaaatatcg agaagatgtt ttaaataaaa aaccttatat tgatgctgag 120 

agagcagttc tagcaacacg cgcctatgaa cgatacaagg aacaacctaa tgtcctaaaa 180 

cgtgcatata tgctgaaaga aattttggaa aatatgacta tctatattga agaagaatct 240 

atgattgcgg gaaatcaagc ttcttccaat aaagatgctc ctatttttcc ggaatatacg 300 

Gtagaatttg ttctcaatga gttggatctt tttgaaaagc gtgatggaga tgttttctat 360 

attacagaag aaacaaaaga acaacttaga agtattgctc cgttttggga aaataataat 420 

ttacgtgcta gagctggtgc cttattacct gaagaagtgt ctgtttatat ggaaacagga 480 

ttcttcggta tggaaggtaa gatgaattct ggagatgctc acttagcagt taactatcag 540 

aaacttttgc aatttggttt aagaggtttt gaagagcggg ctcgtaaagc aaaagtagct 600 

ctagatttaa cagatccagc aagtattgat aaatatcatt tttacgactc tatatttatc 660 

gtaatcgatg ctattaaagt atatgcaaag cgctttgttg ctcttgctaa aagtttagcc 720 

gaaaatgcaa atcctaaacg taagaaagaa ttacttgaga ttgcagatat ttgctctaga 780 

gtcccatatg aaccggcaac tacttttgca gaagctattc aatcagtttg gtttattcaa 84 0 

tgtattttac aaattgaatc taatggccac tctctttcat atggccgttt tgatcaatat 900 

atgtatccat atatgaaggc tgatttagaa agtggtaaag aaacagaaga tagcattgtt 960 

gaacgtctga caaatctttg gattaagaca attacaatta ataaggttcg cagtcaatca 1020 

catacatttt cttcagcagg aagtccttta tatcaaaatg ttacaattgg tggacagact 1080 

cgagataaga aggatgctgt taacccatta tcttatttgg tattaaaatc agttgcacaa 1140 

acccatctac cgcaacctaa tctaactgta cgttaccatg caggtttaga tgctcgtttc 1200 

atgaatgagt gtattgaagt gatgaaactt ggttttggta tgcctgcatt taataatgat 1260 

gagattatta ttccttcttt tattgcaaaa ggagtattgg aagatgatgc ttatgattac 132 0 

agtgccattg gatgtgttga aacggcagtt ccagggaaat ggggctatcg ttgcacaggt 1380 

atgagttata tgaacttccc taaggttcta cttatcacga tgaatgatgg aattgatccg 144 0 

gcttcgggta aacggtttgc accaagcttt ggtcgtttta aggatatgaa gaacttttct 1500 

gaattagaaa atgcttggga taaaacacta agatatttga cacgaatgag tgl:t.attgtt 1560 

gaaaattcta ttgatttatc attggaacga gaagttcctg atattctatg ttcagcattg 1620 

actgatgatt gtattggtcg tggaaaacac cttaaagaag gtggagcagt atatgattat 1680 

atatcaggat tgcaagttgg aiattgcaaat ttgtcggatt cattagctgc aattaciaaaa 1740 

ttggtgtttg aggaagaacg tataagccca agtcagcttt ggcatgcact ggaaacagati 1800 

tatgccggag aagaaggtaa ggtcattcaa. gaaatgttga ttcatgatgc acctaagtiat 1860 

ggtaatgatg atgattatgc tgacaaattg gttactgctg cttatgacat ttatgttgat 1920 

gaaattgcta aatatcct:aa tacacgttat ggaagagggc ctattggagg aattcgttat 1980 

tcaggaacat cttctatctc agccaacgta gggcagggac gtggaacatt agcaactcca 204 0 

gatggacgca acgcgggtac accgttagca gagggttgtt caccatcaca taatatggat 2100 

caacacggcc ctacatctgt tttaaaatct gtttcaaaat taccaacaga tgaaatcgta 2160 

ggtggggttc tcttaaatca gaaagtaaat cctcaaacgt tagccaaaga agaagataaa 2220 

ttaaaactaa ttgctttgtt acgaacattc tttaatcgtt tacatgggta ccatattcaa 2280 

tacaatgttg ttitccagaga gacgctgatt gacgctcaga aacatcctga aaaacacaga 2340 

gacttaattg ttcgtgttgc aggatactct gcattcttca atgttctttc taaggcaacc 2400 

caagatgaca ttataggacg tactgagcat actttg 2436 

SeqXD 14 

atgtcacaag cacaatatgc aggtactgga cgtcgtaaaa acgctgttgc acgcgttcgc 60 

cttgttccag gaactggt:aa aatcactgtt aacaaaaaag atgttgaaga gtacatccca 12 0 

cacgctgacc ttcgfccttgt catcaaccaa ccattcgcag ttacttcaac tgtaggttca 180 

tacgacgttt tcgttaacgt tataggtggt ggatacgctg gtcaatcagg agctatccgt 24 0 

cacggtatcg ctcgtgccct tcttcaagta gacccagact tccgcgattc attgaaacgc 300 

gcaggacttc ttacacgtga ctcacgtaaa gttgaacgta agaaaccagg tcttaagaaa 360 

gctcgtaaag catcacaatt tagtaaacgt 390 

SeqID 15 

ttggagaaga aactgaccat aaaagacatt gcggaaatgg ctcagacctc gaaaacaacc 60 

gtgtcatttt acctaaacgg gaaatatgaa aaaatgtccc aagagacacg tgaaaagatt 120 

gaaaaagtta ttcatgaaac aaattacaaa ccgagcattg ttgcgcgtag cttaaactcc 180 

aaacgaacaa aattaatcgg tgttttgatt ggtgatatta ccaacagttt ctcaaaccaa 24 0 

attgttaagg gaattgagga tatcgccagc cagaatggct accaggtaat gataggaaat 300 

agtaattaca gccaagagag tgaggaccgg tatattgaaa gcatgcttct cttgggagta 360 

gacggcttta ttattcagcc gacctctaat ttccgaaaat attctcgtat catcgatgag 420 

aaaaagaaga aciatggtctt ttttgatagt cagctctatg aacaccggac tagctgggtt 480 

aaaaccaiata actatgatgc cgtttatgac atgacccagt cctgtatcga aaaaggttat 540 

gaacattttc tcttgattac agcggatacg agtcgtttga gtactcggat tgagcgggca 600 
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agtggttttg tggatgcttt aacagatgct aatatgcgtc acgccagtct siaccattgaa 660 

gataagcata cgaatttgga acaiaattaag gaatttttac aaaaagaaat cgatcccgat 720 

gaaaaaactc tggtatttat ccctaactgt tgggccctac ctctagtctt taccgttatc 780 

aaagagttga attataactt gccacaagtt gggttgattg gttttgacaa tacggagtgg 840 

acttgctttt cttctccaag tgtttcgacg ctggttcagc cctcctttga ggsiaggacaa 900 

caggctacaa agattt^tgat tgaccagatt gaaggtcgca atcaagaaga aaggcaacaa 960 

gtcttggatt gtagtgtgaa ttggaaagag tcgactttc 999 

SeqXD 16 

atgaatasiag gattatttga aaaacgttgt aaatatagta ttcggaaatt ttcattaggt 60 

gttgcttctg ttatgattgg agctgcattc tttgggacaa gtccggttct tgcagatagc 120 

gtgcagtctg gttccacggc gaacttacca gctgatttag ctactgctct tgcaacagca 180 

aaagagaatg atgggcgtga t^ttt.gaagcg cctaaggtgg gagaagacca aggttctcca 240 

gaagttacag atggacctaa gacagaagaa gaactattag cacttgaaaa agaaaaaccg 300 

gctgaagaaa aaccasiaaga ggataaacct gcagctgcta aacctgaaiac acctaagacg 360 

gtaacccctg aatggcaaac ggtagcgaat aaagagcaac agggaacagt cactatccga 420 

gaagaaaaag gtgtccgcta caaccaacta tcctcaactg ctcaaaatga tELacgcaggc 480 

aeiaccagccc tgtttgaaaa geiagggcttg accgttgatg ccaatggaaa tgcaactgtt 540 

gatttaacc^ tcaaagatga ttctgaaaag ggcaaatcac gctttggtgt ctttttgsiaa 600 

tttaaagata ccaagaataa tgtttttgtc ggttatgaca aggatggctg gttctgggag 660 

tataaatctc caacaactag cacttiggtat agaggtagtc gtgttgctgc tcctgaaaca 720 

ggatcaacaa accgtctctc tatcactctc aagtcagacg gtcagctaaa tgccagcaat 780 

aatgatgtca atctctttga cacagtgact ctaccagctg cggtcaatga ccatcttaaa 840 

aatgagaaga agattcttct caaggcgggc tcttatgacg atgagcgaac agttgttagc 900 

gtteiaaacgg ataaccaaga gggggtaaaa acagaggata cccctgctga aaaagaaaca 960 

ggtcctgaag ttgatgatag caaggtgact tatgacacga ttcagtctaa ggtcctcaaa 1020 

gcagtgattg accaagcctt ccctcgtgtc aaggaataca gcttgaacgg gcatactttg 1080 

ccaggacagg tgcaacagtt caaccaagtc tttatcaata accaccgaat cacccctgaa 1140 

gtcacttata agaaaatcaa tgagacaaca gcagagtact tgatgaagct tcgcgatgat 1200 

gctcactteuEL tcaatgcgga aatgacagta cgcttgcaag ttgtagacaa tcaattgcac 1260 

tttgatgtga ctaagattgt caaccacaat caagtcactc caggtcaaaa gattgatgac 1320 

gaaagcSLciac tactttcttc tattagtttc ctcggcaatg ctttagtctc tgtttctagt 1380 

aatcaauictg gtgctaagtt tgatggggca accatgtcaa acaatacgca tgtcagcgga 1440 

gatgatcata tcgatgtaac caatccaatg aaggatttgg ctaagggtta catgtatgga 1500 

tttgtttcta cagataagct tgctgctggt gtttggagta actctceiaaa cagctatggt 1560 

g^tggttcga at:gact.ggac t.cgtt:tgaca gcttataaag siaacagtcgg siaatgccaac 1620 

tatgtaggaa tccacagctc tgaatggcaa tgggaaaaag ctt^ataaggg cattgttttc 1680 

Gcagaataca cgaaggaact tccaagtgct aaggttgtta tcactgaaga tgccaatgca 1740 

gacaagsiacg ttgattggca agatggtgcc attgcttatc gtagcattat gaacaatcct 1800 

caaggttggg aaaaagl:t.aa ggatiatcaca gcttaccgta tcgcgatgaa ctttggttct 1860 

caagcacaaa acccattcct tatgaccttg gatggtatca agaaaatcaa tctccataca 1920 

gatggtcttg ggcsiaggtgt tctccttaaa ggatatggta gcgaaggcca tgactctggt 1980 

cacttgaact atgctgatat tggtaagcgt atcggtggtg tcgaagactt caagacccta 2040 

attgagaagg ctaagaaata tggagctcat. ctaggtatcc acgttaacgc tbcagaaact 2100 

tatcctgagt ctaaatactt caatgaaaaa attctccgta agaatccaga tggaagctat 2160 

agctatggtt ggaactggct agatcaaggt atcaacattg atgctgccta tgacctagct 2220 

catggtcgtt tggcacgttg ggaagatttg aagaaaaaac ttggtgacgg tctcgacttt 2280 

atctatgtgg acgtttgggg taatggtcaa tcaggtgata acggtgcctg ggctacccac 2340 

gttcttgcta aagaaatteia caaacaaggc tggcgctttg cgatcgagtg gggccatggt 2400 

ggtgagtacg actctacctt ccatcactgg gcagctgact tgacctacgg tggctacacc 2460 

aataaaggta tcaacagtgc catcacccgc tttatccgta accaccaaaa agatgcttgg 2520 

gtaggggact acagaagtta tggtggtgca gccaactatc cactgctagg tggctacagc 2580 

atgaaagact ttgaaggctg gcagggaaga agtgactaca atggctatgt aaccaactta 2640 

tttgcccatg acgtcatgac taagtacttc caacacttca ctgtaagtaa atgggaaaat 2700 

ggtacaccgg tgactatgac cgataacggt agcacctata aatggactcc agaaatgcga 2760 

gtggaattgg tagatgctga caataataaa gtagttgtaa ctcgtaagtc aaatgatgtc 2820 

aatagtccac aatatcgcga acgtacagta acgctcaacg gacgtgtcat ccaagatggt 2880 

tcagcttact tgactccttg gaactgggat gcaaatggta agaaactttc tactgataag 2940 

gaaaagatgt actacttcaa tacgcaggcc ggtgcaacaa cttggaccct tccaagcgat 3000 

tgggcaaaga gcaaggttta cctttacaag ctaactgacc eiaggtaagac agaagagcaa 3060 

gaactaactg tiaaaagatgg taaaabtacc ctagatcttc tagcaaatca accatacgtt 3120 

ctctatcgtt cgaaacaaac taatcctgaa atgtcatgga gtgaaggcat gcacatctat 3180 

gaccaaggat. ttaatagcgg taccttgciaa cattggacca tttcaggcga tgcttctaag 3240 

gcagaaattg tcaagtctca aggggcaaac gatatgcttc gtattcaagg aaacaaagaa 3300 

aaagttagtc tcactcagaa attaactggc ttgaaaccaa ataccaagta tgccgtttat 3360 

gtbggtgtag ataaccgtag taa&gccaag gcaagtatca ctgtgaatac tggtgaaaaa 3420 

gaagtgacta cbtataccaa taagtctctc gcgctcaact atgttaaggc cbacgcccac 3480 

aatacacgtc gtgacaatgc tacagttgac gatacaagtt acttcceiaaa catgtacgcc 3540 
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ttctttacaa ctggagcgga cgtctcaaat gttactctga cattgagtcg tgaagctggt 3600 

gatcaagcaa cttactttga tgaaattcgt acctttgaaa acetattcaag catgtacgga 3660 

gacaagcatg atacaggtaa aggcaccttc aagcaagact ttgaaaatgt tgctcagggt 3720 

atcttcccat ttgtagtggg tggtgtcgaa ggtgttgaag ataaccgcac tcacttgtct 3780 

gaaaaacaca atccatatac acaacgtggt tggaatggta agaaagtcga tgatgttiatc 3840 

gaaggaaatt ggtcactcaa gacaaatgga ctagtgagcc gtcgtaactt ggtttaccaa 3900 

accatcccac aaaacttccg ttttgaagca ggtaagacct accgtgtaac ctttgaatac 3960 

gaagcaggat cagacaatac ctatgctttt gtagtcggta agggagaatt ccagtcaggt 4020 

cgtcgtggta ctcaagcaag caacttggaa atgcatgaat tgccaaatac ttggacagat 4080 

tctaagaaag ccaagaaggc aaccttcctt gtgacaggtg cagaaacagg cgatacttgg 4140 

gtaggtatct actcaactgg aaatgcaagt aatactcgtg gtgattctgg tggaaatgcc 4200 

aacttccgtg gttataacga cttcatgatg gataatcttc aaatcgaaga aattacccta 4260 

acaggtaaga tgttgacaga ataatgctctg aagaactact tgccaacggt tgccatgact 4320 

aactacacca aagagtctat ggatgctttg aaagaggcgg tctttaacct cagtcaggcc 4380 

gatgatgata tcagtgtgga agaagcgcgt gcagagattg ccaagattga agctttgaag 4440 

aatgctttgg ttcagaagaa gacggctttg gtagcagatg actttgcaag tcttacagct 4500 

cctgctcagg ctcaagaagg tcttgcaaat gcctttgatg gcaatgtgtc tagtctatgg 4560 

catacanctt ggaatggtgg agatgtaggc aagcctgcaa ctatggtctt: gaiaagaacca 4620 

actgaaatca caggacttcg ctatgttccg cgtggatcag gttcaaatgg taacttgcga 4680 

gatgt.gaaac t:tgttgt.gac agatgagtct ggcaaggagc ataccttt.ac tgcaact:gat 4740 

tggccaaata acaacaaacc aaaagatatt gactttggta agacaatcaa ggctaagaaa 4800 

atLtgtcctta ctggtaccaa gacatacgga gatggtggag ataaatacca atctgcagcg 4860 

gaacttatct ttactcgtcc acaggtagca gaaacacctc ttgacttgtc aggctatgaa 4920 

gcagctttgg ttaaggctica gaaattiaaca gacaaagaca atcaagagga agtagct^agc 4980 

gttcaggcaa gcatgaaata bgcgacggat aaccatctct tgacggaaag aatggtggaa 5040 

tactttgcag attatctcaa ccaattaaaa gattctgcta cgaaaccaga tgctccaact 5100 

gtagagaaac ctgagtttaa acttagatct ttagcttccg agcaaggtaa gacgccagat 5160 

tataagcaag aaatagctag accagaaaca cctgaacaaa tcttgccagc aacaggtgag 5220 

agtcaatctg acacagccct catcctagca agtgttagtc tagccctatc tgctctcttt 5280 

gtagtaaaaa cgaagaaaga c 5301 

SeqID 17 

atigaacaaac caacgattct gcgcctaatc aagtatctga gcattagctt cttaagcttg 60 

gttatcgcag ccattgtctt aggcggagga gtttttttct actacgttag caaggctcct 120 

agcctatccg agagtaaact agttgcaaca acttctagta aaatctacga cauitaaaaat 180 

caactcattg ctgacttggg ttctgaacgc cgcgtcaatg cccaagctaa tgatattccc 240 

acagatttgg ttaaggcaat cgtttctatc gaagaccatc gcttcfctcga ccacaggggg 300 

attgatacca tccgtatcct gggagctttc ttgcgcaatc tgcaaagcaa ttccctccaa 360 

ggtggatcaa ctctcaccca acagttgatt aagttgactt acttttcaac ttcgacttcc 420 

gaccagacta tttctcgtaa ggctcaggaa gcttggttag cgattcagtt agaacaaaaa 480 

gcaaccaagc aagaaatctt gacctactat ataaataagg tctacatgtc taatgggaac 540 

tatggaatgc agacagcagc tcEuiaactac tatggtaaag accbcaataa tttaagtt^ta 600 

cctcagttag ccttgctggc tggaatgcct caggcaccaa accaatatga cccctattca 660 

catccagaag cagcccaaga ccgccgaaac ttggtcttat ctgaaatgaa aaatcaaggc 720 

tacatctctg ctgaacagta tgagaaagca gtcaatacac csiattactga tggactacaa 780 

agtctcaaat cagcaagtaa ttaccctgct tacatggata attacctcaa ggaagtcatc 840 

aatcaagtitg aagaagaaac aggctataac ctact^cacaa ctgggatgga tgtctacaca 900 

aatgtagacc aagaagctca aaaacatctg tgggatattt acaatacaga cgaatacgtt 960 

gcctatccag acgatgaatt gcaagtcgct tctaccattg ttgatgtttc taacggtaaia 1020 

gtcabtgcGC agctaggagc acgccatcag tcaagtaatg tttccttcgg aattaaccaa 1080 

gcagtagaaa caaaccgcga ctggggatca actatgaaac cgabcacaga cbatgcbcct 1140 

gccttggagt acggtgtcta cgattcaact gctactatcg ttcacgatga gccctabaac 1200 

taccctggga caaatactcc tgtttataac tgggataggg gctactttgg caacatcacc 1260 

ttgcaatacg ccctgcaaca atcgcgaaac gt^cccagccg tggaaactct aaacaaggtc 1320 

ggactcaacc gcgccaagac tttcctaaat ggtctaggaa tcgactaccc aagtattcac 1380 

tactcaaatg ccatttcaag taacacaacc gaatcagaca aaaaatatgg agcaagtagt 1440 

gaaaagatgg ctgctgctta cgctgccttt gcsiaatggtg gaacttacta taaaccaatg 1500 

tatatccata aagtcgtctt tagtgatggg agtgaaaaag agttctctaa tgtcggaact 1560 

cgtgccatga aggaaacgac agcctatatg atgaccgaca tgatgaaaac agtcttgact 1620 

tatggaactg gacgaaatgc ctatcttgct tggctccctc aggctggtaa aacaggaacc 1680 

tctaactata cagacgagga aattgaaaac cacatcaaga cctctcaatt tgtagcacct 1740 

gabgaactat tbgctggcba tacgcgtaaa battcaatgg cbgtatggac aggcbabtct 1800 

aaccgtcbga caccacttgt aggcaabggc cbbacggbcg ctgccaaagt ttaccgctcb 1860 

abgabgaccb acctgbctga aggaagcaab ccagaagabb ggaababacc agaggggctc 1920 

bacagaaabg gagaabbcgb abtbaaaaab ggbgctcgbb cbacgtggaa cUcaccbgcb 1980 

ccacaacaac ccccabcaac bgaaagttca agcbcabcab cagabagtbc aacbbcacag 2040 

tcbagctcaa ccacbccaag cacacuibaab agbacgacba ccaabcctaa caabeuibacg 2100 

caacaabcaa abacaacccc bgabcaacaa aabcagaabc cbcaaccagc acaacca 2157 
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SeqID 18 

atgagtaaaa aaagacgaaa tcgtcataaa aaagaaggtc aagaaccgca atttgatttt 60 

gatgaagcaa aagagctaac agttggtcaa gctattcgta aaaatgaaga agtggaatca 120 

ggagtcttgc ctgaggattc cattttggac aagtatgtta agcaacacag agatgaaatt 180 

gaggcggata agtttgcgac tcgtcaatac aaaaaagagg agttcgttga aactcagagt: 240 

ctggatgatt taattcaaga gatgcgtgag gctcftagaga agtcagaagc ttcttcggag 300 

gaagttccat cttctgaaga catcttacta cccttgcctc tggacgatga ggagcaaggc 360 

ttggatcctc tattgctaga tgatgaaaat ccaacagaaa tgactgaaga agtggaagag 420 

gagcaaaacc tttctcgtct ggatcaagag gactcagasia agaaaagtaa aaaaggcttt 480 

attttgaccg ttttggcgct tgtatcagta attatttgtg tcagtgctta ttatgtctac 540 

cgtcaagtgg ctcgttcgac taaggaaatt gaaacttctc aatcaactac agccaatcaa 600 

tcggatgtgg atgattttaa tacactttat gacgcctttt acacagatag caatasiaacg 660 

gctttgaaaa atagccagtt tgataaactg agtcaactca agactttact tgataagctg 720 

gaaggtagtc gtgaacatac gcttgccaaa tctaaatatg atagtctagc aacgcaaatc 780 

aaggctattic aagatgtcaa tgctcaattt gagciaaccag ctattgtgga tggtgtgttg 840 

gataccaatg ccaaagccaa atcggatgct aaatttacgg atattsiaaac tggaaatacg 900 

gagctbgatia aagtgctaga taaggct^atc agl:cttggta agagccagca aacaagtact 960 

tctagctcaa gttcaagtca aactagcagc tcaagttcaa gtcaagcaag ttcaaatacg 1020 

act agt gage caaaaccaag tagttcaaat gagactagaa gtagtcgcag tgaagtcaat 1080 

atgggtctct cgagtgcagg ggttgctgtt caaagaagtg ccagtcgtgt tgcctataat 1140 

cagtctgcta ttgatgatag taataactct gcctgggatt ttgcggatgg tgtcttggaa 1200 

caaattctag cgacttcacg ttcacgtggc tatatcactg gagaccaata tatccttgaa 1260 

cgtgtcaata tcgttaacgg caatggttat tacaacctct acaagccaga tggaacctat 1320 

ctictttaccc ttaactgtaa gacaggctac tbtgtcggaa atggcgctgg tcatgcggat 1380 

gacttagatt ac 1392 

SeqiD 19 

atgaagcttt. tgaaaaaaat gatgcaagtc gcattagcag tctttttctt tggtttgcta 60 

gctacaaata cggtatttgc gaataccaca ggtggccgat ttgttgataa ggataataga 120 

aaatattatg taaaagatga tcataaagca atctattggc ataaaataga cggtaaaact 180 

tactattttg gtgatattgg agagatggtt gtcggttggc aatacttaga aattcctgga 240 

acaggttatc gtgataattt attcgataac caaccagtta atgaaattgg ccttcaggag 300 

aagtggtact attttggaca agatggtgct ttgctagaac aaacagataa acaagtacta 360 

gaggcaeiaiaa cgtctgaaaa tacaggaaaa gtatacggtg aacaatatcc tctatctgct 420 

gsLaaagagaa cttattattt tgataataat tatgctgtaa agacaggctg gatttatgaa 480 

gagggtcatt ggtattattt aaataagcta ggaaattttg gcgatgattc ttacaatcca 540 

ctaccaattg gtgaagttgc taagggttgg actcaiagatt ttcatgttac tattgacatt 600 

gatagaagca aacctgctcc atiggtactac ctagatgctt caggtaagat gcttacagat 660 

tggcaaaaag taaacggaaa atggtattat tttggctcct ctggttctat ggcaacaggt 720 

tggaaatatg tacgaggcaa atggtattac ttagataata aaaatggtga tatgaaaaca 780 

ggatggcaat accttggtaa caagtggtac tacctccgtt catcaggagc tatggtaact 840 

ggctggtatc aagatggttc aacttggtac tatttagatc cttctaatgg agatatgaaa 900 

ataggttgga caaaagtaaa tggaeiaatgg tattatctca attcsiaatgg agcaatggtt 960 

acaggtagcc ciaactatcga tggtaaagtt tataatttcg cctcatctgg tgagtggatt 1020 

SeglD 20 

atgaaaattt tgaaaaaaac tatgcaagtt ggactgacag batttttctt tggtttgcta 60 

gggaccagta cagtatttgc agatgattct gaaggatggc agtttgtcca agaaaacgga 120 

agaacctact acaaaaaggg ggacctcaaa gaaaccbact ggcgagtgat tgatggteiag 180 

tactattatt ttgattcbct atctggagag atggtitgtcg gctggcaata tatcccgttit 240 

ccatctaaag gtagtacaat tggtccttac ccaaatggta tcagattaga aggttttcca 300 

aagtcagagt. ggtactacbt cgataaaaat ggagtgctac aagagtttgt tggttggaaa 360 

acattsagaga ttcuiaactaa agacagtgtt ggaagaaagt acggggaaaa acgtgaagat 420 

tcagaagata aagaagageia gcgttattat acgaactatt actttaatca aaatcattct 480 

ttagagacag gttggcttta tgatcagtct aactggtatt atctagctaa gacggaaatt 540 

aatggagaaa actaccttgg tggtgaaaga cgtgcggggt ggataaacga tgattcgact 600 

tggtactacG tagatccaac aactggtatt atgcaaacag gttggcaata tctaggtaat 660 

aagtggtact acctccgttc ctcaggagca atggccactg gctggtatca ggaaggtacc 720 

acttggtatt atttagacca cccaaatggc gatatgaaaa caggttggca aaaccttggg 780 

etacaaatggt actatctcog ttcatcagga gctatggcaa ctggttggta tcaagatggt 840 

tcaacttggt actacctaaa tgcaggtaat ggagacatga agacaggttg gttccaggtc 900 

aatggcaact ggtactatgc ttatagctca ggtgctttgg cagtgaatac gaccgtagat 960 

ggctattctg tcaactataa tggcgaatgg gttcgg 996 

SeqiD 21 

atggttttat ctaagtatta tggagtagcc gatggaatga atgtagaagg gaggggaagt 60 

gcgaatttta ttaaagataa tgtgttaatt acagcggctc acaactacta cagacatgac 120 
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tatgggaaag aagcggatga tatttatgtt cttccggctg ttagtccaag tcaagaacca 180 

tttggsiaaga tcaaagtaaa ggsiagttcgt tatttgaagg atatttagaaa tttaaattct 240 

aaggatgcaa gggaatatga cttggcttta ttaattctag aagagcccat tggtgcaaaa 300 

ttagggactt tgggtcttcc tactagtcaa aaaaiatttga caggaataac tgtgactatc 360 

acaggctatc catcatataa ttttaaaatt catcaaatgt atacagataa aaaacaagtt 420 

ttaagtgatg atggcatgtt cttggattac caagttgata ctttagaggg gtctagtgga 480 

tctacagttt atgatgctag tcaccgtgta gtaggagtgc atactttagg agatggagct 540 

aatcaaatba acagtgcagt taaattaaat gaacgaaatt tgccatttat ttattcggtt 600 

cttaaaggtt actctcttga aggatggaag aaaataaatg gtagttggta ccattataga 660 

caacatgata aacaaacggg ttggcaggag ataaatgata cttggtatta tttagacagt 720 

tccggtaaga tgcttacaga ttggcaaaaa gtaaatggaa aatggtatta tctcaattca 780 

aiatggagcaa tggttacagg tagccaaact atcgatggta aagtttataa cttcgcttca 840 

tctggtgagt ggatt 855 

SegXD 22 

ttgatgaaaa aaactttttt cttactggtg ttaggcttgt tttgccttct tccactctct 60 

gtttttgcca ttgatttcaa gataaactct tatcaagggg atttgtatat tcatgcagac 120 

aiatacggcag agtttagaca gaagatagtt taccagtttg aggaggactt taagggccaa 180 

atcgtgggac ttggacgtgc tggtaagatg cctagcgggt ttgacattga ccctcatcca 240 

aagattcagg ccgcgaaaaa cggtgcagaa ctagcagatg tgactagcga agtaacagaa 300 

gaagcggatg gttatactgt gagagtctat aatccaggtc aggagggcga catagttgaia 360 

gttgacctcg tct;ggaact>t a aaaaattta cttttccttt atgatgatat cgctgaatta 420 

ciattggcaac ctctgacaga tagttcagag tctattgaaa agtttgaatt tcatgtaagg 480 

ggagacaagg gggctgaacia actctttttc catacaggga aactttttag agagggsiacg 540 

attgaaaaga gtaaccttga ttat€u:tatc cgtttagaca atcttccggc taagcgtgga 600 

gttgagttgc atgcctattg gcctcggacc gattttgcta gcgctaggga tcagggattg 660 

aaagggaatc gtttagaiaga gtttaataag atagaagact cgattgttag agaiaaaagat 720 

cagagtaaac aactcgttac ttgggtcctc ccttcgatcc tttccatctc cttgttattg 780 

agtgtctgct tctattttat ttatagaaga aagaccactc cttcagtcaa atatgccaaa 840 

aatcatcgt-c t.ct.at:gaacc accaat^ggaa ttagagccta tggtt.tt:atc agaagcagtc 900 

tactcgacct ccttggagga agtgagtccc ttggtcaagg gagctggaaa attcaccttt 960 

gatcsiactta ttcaagctac cttgctagat gtgatagacc gtgggaatgt ctctatcatt 1020 

tcagaaggag atgcagttgg tttigaggcta gtaaaagaag atggt:ttgtc aagctttgag 1080 

aaagactgcc taaatctagc tttttcaggt aaaaaagaag aaactctttc caatttgttt 1140 

gcggattaca aggtatctga tagtctttat cgtagagcca aagtttctga tgaaaaacgg 1200 

attcaagcaa gagggcttca actcataatct tcttttgaag aggtattgaa ccagatgcaa 1260 

gaaggagtga gaaaacgagt ttccttctgg gggctcccag attattatcg tcctttaact 1320 

ggtggggaaa aggccl^tgca agtgggtatg ggtgccbtga ctatcctgcc cctatttatc 1380 

ggatttggtt tgttcttgta cagtttagac gttcatggct atctttacct ccctttgcca 1440 

atacttggtt ttctagggtt agttttgtct gttttctatt attggaagct tcgactagat 1500 

aatcgtgatg gtgttctaaa tgaagcggga gctgaggtct actatctctg gaccagtttt 1560 

gaaaatatgt tgcgtgagat tgcacgattg gatcaggctg aactggaaag tattgtggtc 1620 

tggaatcgcc tcttggtcta tgcgacctta tttggctatg cggacaaggt tagtcatttg 1680 

atgaagg&tc atcagattca agtiggaaaat ccagatatca atctctatgt agcttatggc 1740 

tggcacagta cgttttatca ttcaacagca caaatgagcc attatgctag tgtcgcaaat 1800 

acagcetagca cctactctgt atcttctgga agtggaagtt ctggtggtgg cttctctgga 1860 

ggcggaggtg gcggcagtat cggtgccttt 1890 

SeqXD 23 

atgaaatcaa tcaacaaatt tttaacaatg cttgctgcct tattactgae agcgagtagc 60 

ctgttttcag ctgcaacagt ttttgcggct gggacgacaa caacatctgt taccgttcat 120 

aaacbattgg caacagatgg ggatatggat aaaattgcaa atgagttaga aacaggtaac 180 

tatgctggta ataaagtggg tgttctacct gcaaiatgcaa siagaaattgc cggtgttatg 240 

ttogtttgga caaatactaa taatgaaatt attgatgaaa atggccaaac tctaggagtg 300 

aatattgatc cacaaacatt taaactctca ggggceiabgc cggcaactgc aatgaaaaaa 360 

btaacagaag ctgaaggagc taaatttaac acggceuiatt taccagctgc taagtataaa 420 

atttatgaaa ttcacagttt atcaacttat gtcggtgaag atggagcaac cttaacaggt 480 

actaaagcag ttccaattga aatbgaatta ccabbgaacg abgbbgbgga bgcgcabgbg 540 

^abccaaaaa abacagaagc aaagccaaaa atbgabaetag abbbcaaagg baaagcaaab 600 

3cagabacac cacgbgbaga baaagabaca ccbgbgaacc accaagbbgg agabgbbgba 660 

jagbacgaaa bbgbbacaaa aabbccagca cbbgcbaabb abgcaacagc aaacbggagc 720 

fabagaabga cbgaaggbbb ggcabbcaac aaaggbacag bgaaagbaac bgbbgabgab 780 

rbbgcacbbg aagcaggbga bbabgcbcba acagaagbag caacbggbbb bgabbbgaaa 840 

:baacagabg cbggbbbagc baaagbgaab gacceiaaacg cbgaaaaaac bgbgaaaabc 900 

icbbabbcgg caacabbgaa bgacaaagca abbgbagaag baccagaabc taatgabgba 960 

icabbbaacb abggbeiabaa bccagabcac gggaabacbc caaagccgaa taagccaaab 1020 

raaaacggcg abbbgacabb gaccaagaca bgggbbgabg cbacaggbgc accctabbccg 1080 

fcbggagcbg aagcaacgbb cgabbbggbb aabgcbcaga cbggbaaagb bgbacaaacb 1140 
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gtaactttga caacagacaa aaatacagtt actgttaacg gattggataa aaatacagaa 1200 

tataaattcg ttgaacgtag tataaaaggg tattcagcag attatcaaga aatcactaca 1260 

gctggagaaa ttgctgtcaa gaactggaaa gacgaaaatc caaaaccact tgatccaaca 1320 

gagccaaaag ttgttacata tggtaaaaag tttgtcaaag ttaatgataa agataatcgt 1380 

ttagctgggg cagaatttgt aattgcaaat gctgataatg ctggtcaata tttagcacgt 1440 

aaagcagata aagtgagtca agaagagaag cagttggttg ttacaacaaa ggatgcttta 1500 

gatagagcag ttgctgctta taacgctctt actgcacaac aacaaactca gcaagaaaaa 1560 

gagaaagttg acaaagctca agctgcttat aatgctgctg tgattgctgc caacaatgca 1620 

tttgaatggg tggcagataa ggacaatgaa aatgttgtga aattagtttc tgatgcacaa 1680 

ggtcgctttg aaattacagg ccttcttgca ggtacatatt acttagaaga aacaaaacag 1740 

cctgctggtt atgcattact aactagccgt cagaaatttg aagtcactgc aacttcttat 1800 

tcagcgactg gacaaggcat tgagtatact gctggttcag gtaaagatga cgctacaaaa 1860 

gtagtcaaca aaaaaatcac tatcccacaa acgggtggta ttggtacaat tatctttgct 1920 

gtagcggggg ctgcgattat gggtattgca gtgtacgcat atgttaaaaa caacaaagat 1980 

gaggatcaac ttgct 1995 

SeqID 24 

atggcggtaa tggcgtatcc gctggtgtct cgcttgtatt atcgagtgga atcaaatcaa 60 

caaattgctg actttgatiaa ggaaaaagca acgttggatg aggctgacat tgatgaacga 120 

atgaaattgg cacaagcctt caatgactct ttgaataatg tagtgagtgg cgatccttgg 180 

tcggaagaaa tgaagaaaaa agggcgagca gagtatgcac gtatgttaga aatccatgag 240 

cggatggggc atgtggaaat ccccgttatt gacgtggatt tgccggttta tgctggtact 300 

gctgaagagg tattgcagca aggggctggg catctagagg gaacttctct gccgatcgga 360 

ggcaattcga cccatgcggt gattacggca catacaggtt tgccaacagc taagatgttt 420 

acggatttga ccaaacttaa agttggggat aagttttatg tgcacaatat caaggaagtg 480 

atggcctatc aagtggatca agtaaaggtg attgagccga cgaactttga tgatttattg 540 

attgtaccag gtcatgatta tgtgaccttg ctgacttgta cgccatacat gatcaatacc 600 

catcgtctat tggttcgggg gcatcggata ccgtacgtag cagaggttga ggaagaattt 660 

attgcagcaa acastactcag tcatctctat cgctacctgt tttatgtggc agttggtttg 720 

attgtgattc ttttatggat tattcgacgc ttgcgcaaga agaaaaaaca accggaaaag 780 

gctttgaagg cgctgaaagc agcaaggaag gaagtgaagg tggaggatgg acaacag 837 

SeqID 25 

atgtcaagga ctaaactacg agccttattg ggatacttgt tgatgttggt agcctgtttg 60 

attcctattt attgttttgg acagatggtg ttgcagtctc ttggacaggt gaaaggtcat 120 

gctacatttg tgaaatccat gacaactgaa atgtaccaag aacaacagaa ccattctctc 180 

gcctacaatc aacgcttggc ttcgcaaaat cgcattgtag atcctttttt ggcggaggga 240 

tatgaggtca attaccaagt gtctgacgac cctgatgcag tctatggtta cttgtctatt 300 

ccaagtttgg aaatcatgga gccggtttat ttgggagcag attatcatca tttagggatg 360 

ggcttggctc atgtggat^gg tacaccgctg cctctggatg gtacagggat tcgctcagtg 420 

attgctgggc accgtgcaga gccaiagccat gtctttttcc gccatttgga tcagctaaaa 480 

gttggagatg ctctttatta tgataatggc caggaaattg tagaatatca gatgatggac 540 

acagagatta ttttaccgtc ggaatgggaa aaattagaat cggttagctc taaaaatatc 600 

atgaccttga taacctgcga tccgattcct acctttaata aacgcttatt agtgaatttt 660 

gaacgagtcg ctgtttatca aaaatcagat ccacaaacag ctgcagttgc gagggttgct 720 

tttacgaaag aaggacaatc tgtatcgcgt gttgcaacct ctcaatggtt gtaccgtggg 780 

ctagtggtac tggcatttct gggaiatcctg tttgttttgt ggeiagctagc acgtttacta 840 
cgagggaaa 

SeglD 26 

atgaagaatc cattttttga aagacgttgt cgttacagta ttcgtaagtt atcagtagga 60 

gcctgctcgc tgatgattgg tgctgtttta tttgctggtc cagccttggc tgaagaaact 120 

gcagttcctg aaaatagcgg agctaataca gagcttgttt caggagagag tgagcattcg 180 

accaatgaag ctgataagca gaatgaaggg gaacatgcta gagaaaacaa gctagaaaag 240 

gcagaaggag tagcgatagc atctgaaact gcttcgccag caagcaatga agctgcaact 300 

actgaaactg cagaagcagc tagcgcagct aaaccagagg aaaaagcaag tgaggtggt^t 360 

gcagaaacac catctgcaga agcaaaacct aagtctgaca aggaaacaga agcaetagccc 420 

gaagcaacta accaagggga tgagtctaaa ccagcagcag aagctaataa gactgaaaaa 480 

gaagt.ccagc cagatgtccc taaaaataca gaaaaaacat taaaaccaaa ggaaatcaaa 540 

tttaattctt gggaagaate gttaaaatgg gaaccaggtg ctcgtgaaga tgatgctatt 600 

aaccgcggat ctgttgtcct cgcttcacgt cggacaggtc atttagtcaa tgaaaaagct 660 

agcEiaggaag caaaagttca agccttatca aacaccaatt ctaaagcaaa agaccatgct 720 

tctgttggtg gagaagagtt caaggcctat gcttttgact attggcaata tctagattca 780 

atggtcttct gggaaggtct cgtaccaact cctgacgtta ttgatgcagg tcaccgtaac 840 

ggggttcctg tatacggtac actcttcttc aactggtcta atagtattgc agatcaagaa 900 

agatt:tgctg aagctttgaa gcaagacgca gatggtagct tcccaattgc ccgtaaattg 960 

gtagacatgg ccaagtatta tggctatgat ggctatttca tcaaccaaga aacaactgga 1020 

gatttggtta aacctcttgg agaaaagatg cgccagttta tgctctatag caaggaatat 1080 
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gctgctaagg taaaccatcc aatcaagtat tcttggtacg atgccatgac ctataactat 1140 

ggacgttatc atcaagatgg tttgggagaa tacaactacc aattcatgca accagaagga 1200 

gataaggttc cggcagataa cttctttgct aactttaact gggataaggc taaaaatgat 1260 

tacactattg Cctactgccaa ctggattggt cgtaatcctt atgatgtatt tgcaggtttg 1320 

gaattgcaac agggtggttc ctacaagaca aaggttaagt ggaatgacat tttagacgaa 1380 

aatgggaaat tgcgcctttc tcttggttta tttgccccag ataccattac aagtttagga 1440 

aaaactggtg aagattatca taaaaatgaa gatatcttct ttacaggtta tcaaggagac 1500 

cctactggcc aaaaaccagg tgacaaagat tggtatggta ttgctaacct agttgcggac 1560 

cgtacgccag cggtaggtaa tacttttact acttctttta atacaggtca tggtaaaaaa 1620 

tggttcgtag atggtaaggt ttctaaggat tctgagtgga attatcgttc agtatcaggt 1680 

gttcttccaa catggcgctg gtggcagact tcaacagggg aaaaacttcg tgcagaatat 174 0 

gattttacag atgcctataa tggcggaaat tcccttaaat tctctggtga tgtagccggt 1800 

aagacagatc aggatgtgag actttattct actaagttag aagtaactga gaagaccaaa 1860 

cttcgtgttg cccacaaggg aggaaaaggt tctaaagttt atatggcatt ctctacaact 1920 

ccagactaca aattcgatga tgcagatgca tggaaagagc taaccctttc tgacaactgg 1980 

acaaat:gaag aatttgatct tagctcacta gcgggtaaaa ccatctatgc agtcaaacta 2040 

tttttcgagc atgaaggtgc tgtaaaagat tatcagttta acctaggaca attaactatc 2100 

tcggacaatc accaagagcc acaatcgccg acaagctttt ctgtagtgaa acaatctctt 2160 

aaaaat:gccc aagaagcgga agcagttgtg caatttaaag gcaacaagga tgcagatttc 222 0 

tatgaagttt atgaaaaaga tggagacagc tggaaattac taactggctc atcttctaca 228 0 

actatttatc taccaaaagt tagccgctca gcaagtgctc agggtacaac tcaagaactg 234 0 

aaggttgtag cagtcggtaa aaatggagtt cgttcagaag ctgcaaccac aacctttgat 2400 

tggggtatga ctgtaaaaga taccagccta ccaaaaccac tagctgaaaa tatcgttcca 246 0 

ggtgcaacag ttattgatag tactttccct aagactgaag gtggagaagg tattgaaggt 252 0 

atgttgaacg gtaccattac tagcttgtca gataaatggt cttcagctca gttgagtggt 2580 

agtgtggata ttcgtttgac caagccacgt accgttgtta gatgggtcat ggatcatgca 264 0 

ggagctggtg gtgagtctgt taacgatggc ttgatgaaca ctaaagactt tgacctttat 2700 

tataaagatg cagatggtga gtggaagcta gctaaggaag tccgtggtaa caaagcacac 2760 

gtgacagata tcactcttga taaaccaatc actgctcaag actggcgctt gaatgttgtc 282 0 

acttctgaca atggaactcc atggaaggct attcgtatct ataactggaa aatgtatgaa 2880 

aagcttgata ctgagagtgt caatattccg atggccaagg ctgcagcccg ttctctaggc 2940 

aataacaagg tacaagttgg ctttgcagat gtaccggctg gagcaactat taccgtttat 3000 

gataatccaa attctcaaac tccgctcgca accttgaaga gcgaagttgg aggagaccta 3060 

gcaagtgcac cattggattt gacaaatcaa tctggtcttc tttattatcg tacccagttg 312 0 

Gcaggcaagg aaattagtaa tgtcctagca gtttccgttc caaaagatga cagaagaatc 3180 

aagtcagtca gcctagaaac aggacctaag aaaacaagct acgccgaagg ggaggatttg 324 0 

gaccttagag gtggtgttct tcgagttcag tatgaaggag gaactgagga cgaa.ctcatt 33O0 

cgcctaactc acgcaggtgt atcagtatca ggttttgata cgcatcataa gggagaacag 3360 

aafccttactc tccaatattt gggacaaccg gtaaatgcta atttgtcagt gactgtcact 3420 

ggccaagacg aagcaagtcc gaaaactatt ttgggaattg aagtaagtca ggaaccgaaa 3480 

aaagattacc tagttggtga tagcttagac ttgtctgaag gacgctttgc agtggcttat 3540 

agcaatgaca ccatggaaga acattccttt actgatgagg gagttgaaat ttctggttac 3600 

gatgctcaaa agactggtcg tcaaaccttg acgcttcatt accaaggcca tgaagttagc 3660 

tttgatgttt tggtatctcc aaaagcagca ttgaacgatg agtacctcaa acaaaaatta 372 0 

gcagaagttg aagctgctaa gaacaaggtg gtctataact ttgcttcatc agaagtaaaa 3780 

gaagccttct tgaaagcaat tgaagcggcc gaacaagtgt tgaaagacca tgaaactagc 3840 

acccaagatc aagtcaatga ccgacttaat aaattgacag aagctcataa agctctgaat 3900 

ggtcaagaga aatttacgga agaaaagaca gagcttgatc gcttaacagg tgaggttcaa 3960 

gaactcttgg ctgccaaacc aaaccatcct tcaggttctg ccctagctcc gcttcttgag 4020 

aaaaacaagg ccttggttga aaaagtagat ttgagtccag aagagcttac aacagcgaaa 4080 

cagagtctaa aagatctggt tgctttattg aaagaagaca agccagcagt cttttctgat 4140 

agtaaaacag gtgttgaagt acacttctca aataaagaga agactgtcat caagggtttg 4200 

aaagtagagc gtgttcaagc aagtgctgaa gagaagaaat actttgctgg agaagatgct 4260 

catgtctttg aaatagaagg tttggatgaa aaaggtcaag atgttgatct ctcttatgct 4320 

tctattgtga aaatcccaat tgaaaaagat aagaaagtta agaaacrtatt tttcttacct 4380 

gaaggcaaag aggcagtaga attggctttt gaacaaacgg atagtcatgt tatctttaca 4440 

gcacctcact ttactcatta tgcctttgtt tatgaatctg ctgaaaaacc acaacctgct 4500 

aaaccagcac cacaaaacac agtccttcca aaacctactt atcaaccgac ttctgatcaa 4560 

caaaaggctc ctaaattgga agttcaagag gaaaaggttg ccttfccatcg tcaagagcat 4620 

gaaaatactg agatgctagt tggggaacaa cgagtcatca tacagggacg agatggactg 4680 

ttaagacatg tctttgaagt tgatgaaaac ggtcagcgtc gtcttcgttc aacagaagtc 4740 

atccaagaag cgattccaga aattgttgaa attggaacaa aagtaaaaac agtaccagca 4800 

gtagtagcta cacaggaaaa accagctca^ aatacagcag ttaaatcaga agaagcaagc 4860 

aaacaattgc caaatacagg aacagctgat gctaatgaag ccctaatagc aggcttagcc 4920 

agccttggtc ttgctagttt agccttgacc ttgagacgga aaagagaaga taaagat 4977 

SeqID 27 

atgtcaatta catcatttgt aaaaagaatt caagatatca ctcgaaacga tgctggtgtt 60 



wo 2004/092209 



PCT/EP2004/003984 



aatggtgatg ctcaacgtat tgagcaaatg tcttggttat tattcttaaa aatttatgat 120 

agccgtgaaa tggtttggga attagaagaa gacgagtatg agtcaattat cccagaggaa 180 

ttaaaatggc gaaattgggc tcatgctcaa aatggggaac gggtattgac aggcgatgaa 240 

ttacttgatt ttgtcaataa caagttattc aaagagttga aagagcttga aataacttca 300 

aatatgccta ttcgaaaaac gattgttaaa tcagcttttg aagatgcsgaa caactatatg 360 

aaaaatggcg tcttgttacg ccaagtcatc aatgttattg atgaagttga tttcaatagc 420 

cctgaagatc gtcattcgtt taatgatatt tacgaaaaaa ttcttaaaga tattcaaaat 480 

gctgggaact caggagaatt ttatacgcca cgtgcagcga ctgattttat tgccgaagtt 540 

cttgacccaa aacttggaga atcaatggca gaccttgctt gcggaacagg aggcttcttg 600 

acttcgactc tgaaccgttt aagtagtcaa cgtaaaacta gtgaagatac caaaaaatat 660 

aatacagctg tttttggtat tgaaaagaaa gcatttcctc atcttttagc agttacaaat 720 

ctgtttcttc acgaaattga tgaccctaaa attgttcatg gaaatacttt ggagaaaaat 780 

gttcgtgaat atacggatga tgaaaaattt gacattatta tgatgaatcc accttttgga 840 

gggtcagaat tagaaacaat aaaaaataac tttccagcag aattacggag ttctgaaaca 900 

gctgatttat ttatggctgt cattatgtat cgtttgaaag aaaatggtcg tgttggagtt 960 

attttacct:g atggttttct atttggtgaa ggtgtaaaaa ctcgcttgaa acaaaaactg 1020 

gtagatgagt tcaacttgca tacgattatt aggttgcctc atagtgtctt tgcaccgtat 1080 

acaggaatcG atacgaacat tcttttcttt gataaaacaa agaaaacaga agaaacttgg 1140 

ttttafccgtt tagatatgcc agatggttat aaaaatttct cgaaaactaa gccgatgaag 1200 

bcagaacact tcaatcctgt tcgtgactgg tgggaaaatc gtgaagagat tctggaaggt 1260 

aagttctaca aatctaaatc atttacacct agtgsiattgg ctgagttgaa ttataattta 1320 

gaccagtgtg actttccaaa agaggaagag gaaatcttaa atccctttga gttgattcag 1380 

aattatcaag cggaaagagc aactttaaat cataagattg ataatcftatt agctgatatt 1440 

ttgcagttgt t.ggaggacaa a 1461 

SeqXD 28 

atgaacaata ctgaatttta tgatcgtctg ggggtatcca aaaacgcttc ggcagacgaa 60 

atcasiaaagg cttatcgtaa gctttccaaa aaatatcacc cagatatcaa caaggagcct 120 

ggtgctgagg acaagtacaa ggaagttcaa gaagcctatg agactttgag tgacgaccaa 180 

aaacgtgctg cctatgacca gtatggtgct gcaggcgcca atggtggttt tggtggagct 240 

ggtggtttcg gcggtttcaa tggggcaggt ggcttcggtg gttttgagga tattttctca 300 

agtttcttcg gcggaggcgg ttcttcgcgc aiatccaaacg ctcctcgcca aggagatgat 360 

ctccagtatc gtgtcaattt gacctttgaa gaagctatct tcggaactga gaaggaagtt 420 

aagtatcatc gtgaagctgg ctgtcgtaca tgtaatggat ctggtgctaa gccagggaca 480 

agtccagtca cttgtggacg ctgtcatggc gctggtgtca ttaacgtcga tacgcagact 540 

cctcttggta tgatgcgtcg ccaagtaacc tgtgatgtct gtcacggtcg aggaaaagaa 600 

atcaaatatc catgtacaac ctgtcatgga acaggtcatg agaiaacaagc tcatagcgta 660 

catgtgaaaa tccctgctgg tgtggaaaca ggtcaacaaa ttcgcctcgc tggtcaaggt 720 

gaagcaggct ttaacggtgg accttatggt gacttgtatg tagtagtttc bgtggaagct 780 

agcgacaagt ttgaiacgtga aggaacgact abcttctaca atctcaacct caactttgtc 840 

caagcggctc ttggtgatac agtagatatt ccaactgttc acggtgatgt tgaattggtt 900 

attccagagg gaactcagac tggtaaggiag ttccgcctac gtagbaaggg ggcaccgagc 960 

cttcgtggcg gtgcagttgg tgaccaatac gttactgtta atgtcgtaac accgacaggc 1020 

btgaacgacc gccaaaaagt agccttgaaa gsLattcgcgg ctgctggtga cttgaaagta 1080 

aatccaaaga aaaaaggctt ctttgaccat attaaagatg cctttgatgg agaa 1134 

SeqXD 29 

atgaiatccta atctttttag aagcgtcgag ttttatcaga gacgttacca taactatgcg 60 

acagtgttaa ttatacctct ttcattacta tttactttca tcttgatttt ctcccttgtt 120 

gccacaaaag aaattactgt tacttcccaa ggagaaatcg cccctacaag tgtcattgcc 180 

tccattcagt caaccagtga tatatcctatc ctagctaatc atttagtggc aaatcaagta 240 

gttgaeuaaag gggacttact catcaaatac bctgcUiacaa tggaagaaag tcagaaaact 300 

gccttagcaa ctcaattaca aagacttgag aagcaaaaag aaggacttgg aattttgaaa 360 

caaagcttag aaaaagcgac tgatcttttt tctggcgagg atgaatttgg ctaccataat 420 

acctbtatga attttactaa acaatcccat gatattgaac tgggtatcac aaagactaac 480 

accgaagttt caaatcaagc taatctttcc aatagcagtt catcagctat tgsiacaagaa 540 

attacaaaag ttcaacaaca aattggagaa tatcaagagt tgagagatgc tatcatasiab 600 

aacagagcac gcttaccaac tggcsiatccg caccagtcaa ttttgeiatcg ttatcttgta 660 

gcctcacaag gacaaacaca aggaactgca gaggagccat ttttatctca aattaatcaa 720 

agtattgcag gtcttgaiatc atctatcgca agcctcaaaa ttcagcaagc tggtatcgga 780 

agtgtagccia cttatgataa cagtttagca accaaaattg aagtactccg cactcagttt 840 

ttacagacag cctcacagca acaactaact gtggagaatc aattaacaga attaaaagta 900 

caactagabc aagccacaca gcgtttggaa aacaiatacct taacctcccc aagtauLaggt 960 

atcgttcatc tgaacagcga atttgaaggt: aaaaatagaa ttccaactgg tacagaaatt 1020 

gctcasiatab tccctgtcat cacagataca agagaagtac taatcactta ctacgtatct 1080 

tctgactatc tacctctact agataaagga caaactgtaa gattaaaact ggagaagatt 1140 

ggaaatcacg gcaccaccat catcggccaa cttcagacaa ttgatcaaac tcctaccaga 1200 

acagagcaag gaaatctctt taaiattaacc gctcttgcaa aactatctaa cgaggatagt 1260 
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aaactcatcc aatatggctt acaaggtcgc gtcactagtg t:aactacciaa gaaaacatat 1320 

tttgattatt tcaaiagataa aattttaaca cattctgat 1359 

SeqIB 30 

atgtcaaaga aactcaatcg taaaaaacaa ttacgaaatg gcctccgtcg cgcaggtgcc 60 

ttttcaagt:a cggtgacteia ggttgtagat gagacaaaaa aagtcgtgaa gcgtgcagaa 120 

cagtcagcaa gcgcagctgg taaggctgtt tctaaaaaag ttgaacaagc agtagaagct 180 

accaaagagc aagcticaaaa agtiagctaiat tctgtagaag attttgcagc aaatttgggt 240 

ggacttccac ttgatcgtgc caagactttc tatigatgaag gaatcaagtc tgctl:cagat 300 

ttcaaaaact ggactgaaaa agaactcctt gccttgaaag gsiatcggccc agctaccatc 360 

aagaaattga siagaaaatgg catcsiagttc aag 393 

SeqID 31 

ttgattagcc ttttcggcct tgctgctgcc aaaccagtcc aggctgatac aagtatcgca 60 

gacattcaaa aaagaggcga actiggttgtc ggtgtcasiac aagacgttcc caattttggt 120 

tacaaagatc ccaagaccgg tacttattct ggtatcgaaa ccgacttggc caagatggta 180 

gctgatg8iac tcaaggtcaa gattcgctat gtgccggtta cagcacaaac ccgcggcccc 240 

cttctagaca atgaacaggt cgatatggat atcgcgacct ttaccatcac ggacgaacgc 300 

aaaaaactct acaactttac cagtccctac tacacagacg cttctggatt tttggtcaat 360 

aeiatctgcca aaatcaaaaa gat^tgaggac ctaaacggca aaaccatcgg agtcgccccua 420 

ggttctatca cccaacgcct gattactgaa ctgggtaaaa agaaaggtct gaagt:ttaaa 480 

ttcgtcgaac ttggttccta cccagaattg attacttccc tgcacgctca tcgtatcgat: 540 

acctttt^ccg ttgaccgctc tatitctatct ggctacacta gtaaiacggac agcactacta 600 

gatgatagtt tcaagccatc tgactacggt attgttacca agaaatcaaa tacagagctc 660 

aacgactatc ttgataactt ggttactaaa tggagcaagg atggtagttt gcagaaactt 720 

tabgaccgtt acaagctcaa accatctagc catactgcag at 762 

SeqID 32 

atgagtaata tcagtttaac aacacttggt ggtgtgcgtg agaatggaaa aaatatgtac 60 

attgctgaaa ttggagagtc catttttgtt ttgaatgtag ggttaaaata tcctgaaaat 120 

gaacaattag gggtcgabgt ggtgattcca aacatggatt acctttttga aaatagcgac 180 

cgtattgctg gggttttctt gacccacggg catgcggatg ccsattggtgc tctaccgtat 240 

ctcttggcag aggctaaagt tcctgtattt gggtctgagt tgaccattga gttggcaaag 300 

ctctttgtca aaggaaatga tgccgttaag aaatttaatg atttccatgt cattgatgag 360 

eiatacggaga ttgattttgg tgggacagtg gtttccttct tccctacgac ttactccgtt 420 

Gcagagagtc tgggaattgt cbtgaagaca tcggeuiggaa gcatcgttta tacaggbgac 480 

ttcaaatttg accaaacggc tagtgaatct tatgcaactg attttgctcg tttggcagag 540 

attggtcgtg acggcgtcct ggctctcctc agtgattcgg ccaatgcaga cagcaatatt 600 

caggt.ggct.a gtgaaagbga agttagggat gaaattaccc aaactatbgc tgactgggaa 660 

ggtcgtatca tcgttgcagc tgtttccagt aatctbtcbc gtattcagca gatttbtgac 720 

gcbgcggata aaacaggtcg acgbatcgbc bbgacaggab tbgababbga aaababcgbc 780 

cgcacagcga ttcgtcttaa gaagttgtct tbagccaacg siaabtcbbtt gatbaagccb 840 

aaagababgb ctcgcbttga agaccabgag bbgabbabbc ttgagacagg tcgtatgggb 900 

gagccbatca abggacbtcg taagabgbcg abbggbcgcc abcgbbabgb agasuibcaiag 960 

gatggggacc bagbcbatab tgctacggct ccgtcbatbg cbaaagaagc cbbbgbbgcg 1020 

cgbgtggaaa ababgatbba tcaggcaggt ggggbbgtca aabbgatbac ccaaagttba 1080 

catgtatcag ggcacggaaa bgbgcgtgat tbgcagctga bgabcaabcb ttbgceuicct 1140 

aagtaccbcb tccctgtcca aggggagtat cgtgagttgg abgcbcacgc taaggctgcc 1200 

atggcagbtg ggatgbtgcc agaacgcatc ttcattccta aaaaggggac gaccatggcb 1260 

tacgagaatg gagacbttgb tccagctgga tcggtttcag caggagatat ctbgatbgat 1320 

gggaatgcca ttggtgatgt tggaaatgtt gttcbbcgbg accgbaaggt cbtgtcagag 1380 

gatggaattt tcatcgtggc tattacagbc aaccgbcgtg agaagaaaab tgbggctagg 1440 

gcbcgbgbbc acacgcgtgg abttgbttat ctcaagaaga gbcgcgabat tctccgbgaa 1500 

agbtcagaab bgabbaacca aacggbagaa gagtatctbc aaggagatga cbtbgacbgg 1560 

gcagabcbca aaggbaaggb bcgbgacaat cbgaccaagb accbcbbbga bcaaaccaag 1620 

cgbcgeccag ccabtbbacc agbagbcabg gsiagcaaaa 1659 

SeqID 33 

atgaaaaaaa gbacagbabb gbcabbaacc acagcbgcag bbabbbbagc agccbabgcc 60 

cctaatgagg tagbctbagc agacacabct agctcbgaag abgcbbbaaa cabcbcbgab 120 

aaagaaaaag bagcagaaaa taaagagaaa cabgaaaaba tccabagbgc batggaaacb 180 

bcacaggabb bbaaagagaa gaaaacagca gbcabbaagg aaaaagaagb bgbbagbaaa 240 

aatccbgbga tagacaabaa cactagcaat gaagaagcaa aaabcaaaga agaaaabbcc 300 

aataaatccc aaggagabba tacggactca bttgbgaaba aaaacacaga aaabcccaaa 360 

aaagaagaba aagbbgbcta batbgcbgaa bbtaaagaba aagaabcbgg agaaaaagca 420 

abcaaggaac tatccagtct baagaabaca aaagbbbbab abacbbabga bagaabbbbb 480 

aacggtagbg ccabagaaac aacbccagab aacbbggaca aaabbaaaca aabagaaggb 540 

atbbcatcgg bbgaaagggc acaaaaagbc caacccabga bgaabcabgc cagaeiaggaa 600 
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attggagttg aggaagctat tgattaccta aagtctatca atgctccgtt tgggaaaaat 660 

tttgatggta gaggtatggt catttcciaat atcgatactg gaacagatta tagacataag 720 

gctatgagaa tcgatgatga tgccaaagcc tcaatgagat ttaaaaaaga agacttaaaa 780 

ggcactgata aaaattattg gttgagtgat aaaatccctc atgcgttcaa ttattataat 840 

ggtggcaaaa tcactgtiaga aaaat^atgat gatggaaggg attattttga cccacatggg 900 

atgcatattg cagggattct tgctggaaat: gatactgaac aagacatcaa aaactttaac 960 

ggcatagatg gaattgcacc taatgcacaa attttctctt acaaaatgta ttctgacgca 1020 

ggatctgggt ttgcgggtga tgaaacaatg tttcatgcta ttgaagattc tatcasiacac 1080 

aacgttgatg ttgtttcggt atcatctggt tttacaggaa caggtcttgt aggtgagaaa 1140 

tattggcaag ctattcgggc attaagaaaa gcaggcattc caatggttgt cgctacgggt 1200 

aactatgcga cttctgcttc aagttcttca tgggatttag tagcaaataa tcatctgaaa 1260 

atgaccgaca ctggaaatgt aacacgaact gcagcacatg aagatgcgat agcggtcgct 1320 

tctgctaaaa atcaaacagt tgagtttgat aaagttaaca taggtggaga aagttttaaa 1380 

tacagaaata taggggcctt tttcgataag agtaaaatca caacaaatga agatggaaca 1440 

aaagctccta gtaaattaaa atttgtatat ataggcaagg ggcaagacca agatttgata 1500 

ggtttggatc ttaggggcaa aattgcagta atggatagaa tttatacaaa ggatttaaaa 1560 

aatgctttta siaaaagGtat ggataagggt gcacgcgcca ttatggttgt aaatactgta 1620 

aattactaca atagagataa ttggacagag cttccagcta tgggatatga agcggatgaa 1680 

ggtactaaaa gtcciagtgtt ttcaatttca ggagatgatg cftgtaaagct atggaacatg 1740 

attaatcctg ataaaaaaac tgaagtcaaa agaaataata aagaagattt taaagataaa 1800 

ttggagcaat actatccaat tgatatggaa agttttaatt ccaacaaacc gaatgtaggt 1860 

gacgaaaaag agattgactt taagtttgca cctgacacag acaaagaact ctataaagaa 1920 

gatatcatcg ttccagcagg atctacatct tgggggccaa gaatagattt acttttaaaa 1980 

cccgatgttt cagcacctgg taaaaatatt aaatccacgc ttaatgttat taatggcaaa 2040 

tcaacttatg gctatatgtc aggaactagii atggcgactc Cciatcgtggc agcttctact 2100 

gttttgatta gaccgaaatt aaaggaaatg cttgaaagac ctgtattgaa aaatcttaag 2160 

ggagatgaca aaatagatct tacaagtctb acaaaaattg ccctacaaaa tactgcgcga 2220 

cctatgatgg atgcaacttc tl^ggaaagaa aaaagtcaat actttgcatc acctagacaa 2280 

cagggagcag gcctaattaa tgtggccaat gctttgagaa atgaagttgt agcaactttc 2340 

aacuiacactg attctaaagg tttggtaaac tcatatggtt ccatttctct taaagaaata 2400 

aaaggtgata aaaaatactt tacaatcaag cttcacaata catcaaacag acctttgact 2460 

tttaaagttt cagcatcagc gataactaca gattctctaa ctgacagatt aaaacttgat 2520 

gaaacatata aagatgaaaa atctccagat ggtaagcaaa ttgttccaga aattcaccca 2580 

gaasiaagtca aaggagcaaa tatcacattt gagcatgata ctttcactat aggcgcaaat 2640 

bctagctttg atttgaatgc ggttataaat gttggagagg ccaaaaacaa aaataaattt 2700 

gtagaatcat ttattcattt tgagbcagtg gaagaaatgg aagctctaaa ctccaacggg 2760 

aagauataa acttccsiacc ttctttgtcg atgcctctaa tgggatttgc tgggaattgg 2820 

aaccacgaac caatccttga tsuiatsgggct tgggaagaag ggtcaagatc aaaaacactg 2880 

ggaggttatg atgatgatgg taaaccgaaa attccaggaa ccttaaataa gggaattggt 2940 

ggagaacatg gtatagataa atttaatcca gcaggagtba tacacieiatag aaaagataaa 3000 

ciatacaacat ccctggatca aaatccagaa ttatttgctt tcaataacga agggatcaac 3060 

gctccabcat caagtggttc taagattgct aacatttatc ctttagattc aaatggaaat 3120 

cctcaagatig ctcaacttga aagaggatta acaccttctc cacbtgtatb aagaagtgca 3180 

gaagaaggat tgatbtcaat agtsiaataca aataaagagg gagaaaatca aagagactta 3240 

aaagtcattt cgagagaaca ctbtattaga ggaattttaa attctaaaag caatgatgca 3300 

aagggaatca aatcatctaa actaaaagtt tggggtgact tgaagtggga tggactcatc 3360 

babaatccba gaggtagaga agacUEULtgca ccagaaagta aggataatca agatcctgct 3420 

actEiagataa gaggtceiatt bgaaccgatt gcggaaggtc aatatttcta taaatttaaa 3480 

tatagattaua. ctaaagatta cccatggcag gtttcctata ttcctgtsiaa aabtgataac 3540 

accgccccta agatbgbttc ggttgatttt tcaaatcctg aaaaaattaa gttgattaca 3600 

aaggatacbt atcataaggb aaaagatcag tataagaatg aaacgctatb tgcgagagab 3660 

caaaaagaac atcctgaaaa abbtgacgag attgcgaacg aagbttggta tgctggcgcc 3720 

gctcttgtta atgaagatgg agaggttgaa aaaaatcttg aagtaactta cgcaggtgag 3780 

ggtcaaggaa gaaiabagaaa acttgataaa gacggastata ccatttatga aattaaaggt 3840 

gcgggagabt taaggggaaa aatcattgaa gtcattgcat tagatggttc tagcaatttc 3900 

acaaagattc atagaattaa atttgctaat caggctgatg aasuaggggat gatttcctat 3960 

tatctagtag atcctgatca agattcatct aaatatcaaa agcttggcga gattgcagaa 4020 

tctaaattta aiaaabttagg aeiatggaaaa gagggtagtc taaaaaaaga tacaactggg 4080 

gtagaacatic atcatcaaga aaatgaagag tctattaaag aaaaatctag ttttactatt 4140 

gatagaaata ttbcaacaat tagagacttt gaaaataaag acttaeiagaa actcattaaa 4200 

aagaaattta gagaagtbga tgattttaca agtgaaactg gtaagagaat ggaggaatac 4260 

gattataaab acgabgabaa aggaaabata abagccbacg abgabgggac bgabcbagaa 4320 

babgaaacbg agaaacbbga cgaaabcaaa bcaeiaaabbb abggbgbbcb aagbccgbcb 4380 

aaagabggac acbbbgaaab bcbbggaaiag abaagbaabg bbbcbaaaaa bgccaaggba 4440 

babbabggga abaacbabaa abcbabagaa abcaaagcga ccaagbabga bbbccacbca 4500 

asiaacgabga cabbbgabcb abacgcbaab abbaabgaba bbgbggabgg abbagcbbbb 4560 

gcaggagaba bgagabbabb bgbbaaagab aabgabcaga aaaaagcbga aabbaaaabb 4620 

agaabgccbg aaaaaabbaa ggaMcbaaa bcagaababc ccbabgbabc aagbbabggg 4680 
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aatgtcatag aattagggga aggagatctt: tcaaaaaaca aaccagacaa tttaactaaa 4740 

atggaatctg gtaaaatcta ttctgattca gaaaaacaac aatatctgtt aaaggataat 4800 

atcattctaa gaaaaggcta tgcactaaaa gtgactacct ataatcctgg aaaaacggat 4860 

atgttagaag gaaatggagt ctatagcaag gaagatatag caaaeiataca aaaggccaat 4920 

cctaatctaa gagccctttc agaaacaaca atttatgctg atagtagaaa tgttgaagat 4980 

ggaagaagta cccaatctgt attaatgtcg gctttggacg gctttaacat tataaggtat 5040 

caagtgttta catttaaaat gaacgataaa ggggaagcta tcgataaaga cggaaatctt 5100 

gtgacagatt cttctaaact tgtattattt ggtaaggatg ataaagaata cactggagag 5160 

gataagttca atgtagaagc tataaaagaa gatggctcca tgttatttat tgataccaaa 5220 

ccagtaaacc tttcaatgga taagaactac tttaatccat ctaaatctaa taaaatttat 5280 

gtacgaaatc cagaatttta t:ttaagaggt aagatttctg ataagggtgg ttttaactgg 5340 

gaattgagag ttaatgaatc ggttgtagat aattatttaa tctacggaga tttacacatt 5400 

gataacacta gagattttaa tattaagctg aatgttaaag acggtgacat catggactgg 5460 

ggaatgaaag actataaagc aaacggattt ccagataagg taacagatat ggatggaaat 5520 

gtttatcttc aaactggcta tagcgatttg aatgcbaaag cagttggagt ccactatcag 5580 

tttttatatg ataatgttaa acccgaagta aacattgatc ctaagggaaa tactagtatc 5640 

gaatatgctg atggeiaaatc tgtagtcttt aacatcaatg ataaaagaaa taatggattc 5700 

gatggtgaga ttcaagaaca acatatttat ataaiabggaa aagaatatac atcatttaat 5760 

gatattaaac aaataataga csuigacacta aacattaaga ttgttgteiaa agattttgca 5820 

agaaatacaa ccgtaaaiaga attcatttta aataaagata cgggagaggt aagtgaatta 5880 

aaacctcata gggtaactgt gaccattcaa aatggaaaag aaatgagttc aacgatagtg 5940 

tcggaagaag attttatttt acctgtttat aagggtgaat tagaaaaagg ataccaattt 6000 

gatggttggg aaatttctgg tttcgaaggt aaaaaagacg ctggctatgt tattaatcta 6060 

tcaaaagata cctttataaa acctgtattc aagaaaatag aggageiaaaa ggaggaagaa 6120 

aataaaccta cttttgatgt atcgaaaaag aaagataacc cacaagtaaa. ccatagtcaa 6X80 

ttaaatgaaa gtcacagaiaa agaggattta Ceiaagagaag agcattcaca aaaatctgat 6240 

tcaactaagg atgttacagc tacagttctt gataaaiaaca atatcagtag taaatcaact 6300 

actaacaatc ctaataagbt gccaaaaact ggaacagcaa gcggagccca gacactatta 6360 

gctgccggaa taatgtttat agtaggaatt tttcttggat tgaagaaaaa aaatcaagat 6420 

SeqXD 34 

atggggaaag gccattggaa tcggaaaaga gtttiatagca ttcgtaagtt tgctgtggga 60 

gcttgctcag taatgatbgg gacttgtgca gttttattag gaggaaatat agctggagaa 120 

tctgtagttb abgcggatga aacacttabt actcatactg ctgagaaacc taaagaggaa 180 

aaaatgabag tagauigaasia ggctgataaa gctttggaaia ctaaaaatat agttgaaagg 240 

acagaacaaa gtgsuicctag tbcaacbgag gctattgcat cbgagaagaa agaagatgaa 300 

gccgtaactc caaaagagga aaaagtgtct gctaaaccgg aagaaaaagc tccaaggaba 360 

gaatcacaag cttcaaatca agaaaaaccg ctcaaggaag abgctaaagc tgtaacaaat 420 

geiagaagbga atcaaatgat tgaagacagg aaagtggatt ttaatcaaaa ttggtacttt 480 

aaactcaatg caaattctaa ggaagccatt aaacctgatg cagacgbatc tacgtggcuia 540 

aaattagatt taccgtatga ctggagtatc tttaacgatt tcgatcatga atctcctgca 600 

caaaatgaag gtggacagct caacggtggg gaagcttggt atcgcaagac tttcaaacta 660 

gatgaaaaag acctcaagaa aaatgttcgc cttacttttg atggcgtcta catggattct 720 

caagtttabg tcaatggtca gttagtgggg cattatccaia atggttataa ccagttcbca 780 

tatgatatca ccaaatacct tcaaaaagat ggtcgtgaga atgtgattgc tgtccatgca 840 

gtcaacaaac agccaagbag ccgttggtat tcaggaagtg gtatctatcg tgatgtgact 900 

tbacaagtga cagataaggb gcatgttgag aiaaMtggga caactatttt aacaccstaaa 960 

cttgaagaac aacaacabgg caaggtbgaa actcatgtga ccagcaaaat cgtcaatacg 1020 

gacgacaaag accatgaact: tgtagccgaa tatcaaatcg ttgeiacgagg tggtcatgct 1080 

gtaacaggct tagttcgtac agcgagtcgt accttaaaag cacatgaatc aacaagccba 1140 

gatgcgattt tagaagttga aagaccaaaa ctctggactg ttttaaatga caaacctgcc 1200 

ttgtacgaat tgattacgcg tgtttaccgt gacggtcaat tggttgatgc taagaaggat 1260 

ttgtttggtt accgttacta tcactggact ccaaatgaag gtttctcttt gaatggtgaa 1320 

ogtattaaat tccatggagt atccttgcac cacgaccatg gggcgcttgg agcagaagaa 1380 

aactataaag cagaababcg ccgtctcaaa caaatgaagg agatgggagt taactccatc 1440 

cgtacaaccc acaaccctgc tagtgagcaa accttgcaaa tcgcagcaga actaggbtta 1500 

cbcgttcagg aagaggcctt tgatacgtgg tatggtggca agaaiacctta tgactatgga 1560 

cgbttcttbg aaaaagatgc cactcaccca gaagctcgaa aaggtgsiaaa atggtctgat 1620 

tttgacctac gbaccatggt. cgaaagaggc ataaaacaacc ctgctatctt catgtggtca 1680 

atbggtaatg aaataggtga agctaatggt gatgcccact ctttagcaac bgttaaacgt 1740 

ttggttaagg ttatcaagga tgttgataag actcgctatg ttaccatggg agcagataaa 1800 

tbccgtbtcg gtaatggrtag cggagggcat gagaaaattg ctgatgaact cgatgctgtt 1860 

ggatttaact attcbgaaga taattacaaa gcccttagag ctaagcatcc aaaatggttg 1920 

atttatggat cagaaacatc ttcagctacc cgtacacgtg gaagttacta. tcgccctgaa 1980 

cgtgaattga aacatagcaa tggacctgag cgtaattatg aacagtcaga ttatggaaat 2040 

gatcgtgtgg gttgggggaa aacagcciacc gcttcatgga cttttgaccg tgacaacgct 2100 

ggctatgctg gacagtttab cbggacaggt. acggactata ttggtgaacc tacaccatgg 2160 

cacaaccaaa atcaaactcc tgttaagagc tcttactttg gtatcgtaga tacagccggc 2220 
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attccaaaiac atgacttcta tctctaccaa 
atggtacacc ttcttcctca ctggaactgg 
gactcagaag gtaagattcc agttcgtgct 
ttgaatggaa aatctcttgg tcttaagact 
acttaccaag aaggtgcaaa tgctaatgaa 
ccaggtacct tggaagcaat tgctcgtgat 
attacgactg ctggtaagcc agcggcagtt 
gcagatggaa aagacttgac ttacatctac 
gttccaactg ctaataatct ggttcgcttc 
gtagataacg gagaacaagc cagccgtgaa 
attcgtaaag catttaatigg taaaggtgtt 
aaattcaccc tgactgccca ctctgatctc 
ggtaagaaag aaggacaaga gaagactgtt 
at.tat:tggag aggcacct:ga aatgcctacc 
cgtgcagaac gtcctgtaac ctggtcttca 
gtgaaaggta tggctgacgg acgagaagta 
tcagagctac cagttgtgaa acgtattgct: 
tctgtttcct atgttttgat tgatggaagt 
attigccgaag aagataaagc t.aagt^agca 
tatttagaag gtcaaccaat tcatgcaacc 
cctgcagtac caactgtaac ggttggtggt 
ccaatgcaat accgcactct. tgct^tatgga 
aaaaatgcag ctgttacagt tcttcaagca 
tttattcagc cteiaagatgg tggccctctt 
gcgccaaaaa ttgctcactt gagcttigcaa 
caaactgtca aattgticggt tcgagctcac 
gctgataaag taaccttctc tacaagtggt 
cttgagttgc ataagccagg agcagtcact 
caagttgaac tcactatcca agccaatact 
gt:aaat.gt:ag t:gaGagattt; gcatcaggaa 
tatgacaaag gtttccctaa aactcataaa 
ctagactcct atcaaacatt tgaagtacta 
cgt:gcaaaag tctctgtaga aggt:at:cgtt: 
atcgcagaag caccacciatt accagaaagt 
tcatcagcta aggttgcatg ggatgcgatt 
tttacagtta atggtcgctt agaaggtacg 
tctgctcaaa ctgagcaagg tgcaaacatt 
cttigcctl^tg ctticagactc aaatccaagc 
atttcctaca ataaccaacc agccsLatcgt: 
gcttcagtcg gtgttctgtt tggagattca 
ctaagtgtcg gattccatga agaccatgga 
tattatgttg gtaagactgt cccaacagct 
gaccatgtct ttaatgattc tgccaactgg 
caactcaagg ctggagaaat gaaccacttt: 
cgbattcgca tggttaaagc agataacaag 
tttgcgaaac aagttgcggc agccaagcaa 
gacttagceia acttcaaccc tgatttgaca 
gttccggcag tcacagcaag tgttagceiac 
cgtgaaggtg agccagttcg tgtcatcgcg 
taccgtctgc acttcactaa ggataagagc 
aaacaagctc gcttgctaca agtaggtcaa 
tacttcacag gtaaagacgg ctacgaaaca 
ccagcggaaa atctgacaaa agcaggtcaa 
otaccttgttg ctgagatcac tgtacgagtg 
aaccctaact atgatgaaaa cagtaaccag 
aaaa a ctctc atgaccgcgt tgactatctc 
tggacaaact ggtcaccaac accat:cttct: 
cgtgaaaatg gtaagattgt agaacggact 
gatagtggta cggatgcacc atctaaactc 
gaagt.gccsia cctactattc sLaactaccaa 
ccagaaaatt gggaagctgt tccttatcgt 
atcaacgtaa catttaaagc tatcaaagcc 
sfcagataaga gcggtgttgc gatgattgag 
saagaaagca ctcaatcaaa gattcttgta 
aatcgtcaag actatcaaat tacctataaa 
aacaatcaag tagcttcaac tgtggtagat 
sgcctcgttt cagaaagtgg aaaacaagtc 
laaccagttt ctgagaagac agttgctgct 



agccaatggg tttctgttaa gaagaaiaccg 2280 

gaaaacaaag aattagcatc caaagtagct 2340 

tattcgaatg cttctagtgt agaattgttc 2400 

ttcaataau aacaaaccag cgatgggcgg 2460 

ctttatcttg aatggaaagt tgcctatcaa 2520 

gaatctggca aggaaattgc tcgagataag 2580 

cgtcttatta aggaagacca tgcgattgca 2640 

tatgaaattg ttgacagcca ggggaatgt.g 2700 

caattgcatg gccaaggtca actggtcggt 2760 

cgctataagg cgcaagcaga tggttcttgg 2820 

gccattgtca aatcaactga acaagcaggg 2880 

ttgaaatcga accaagtcac tgtctttact 2940 

ttggggacag aiagtgccaaa agtacagacc 3000 

actgttccgt ttgtatacag tgatggtagc 3060 

gtagatgtga gcaagcctgg tattgtaacg 3120 

gaagctcgtg tagaagtgat tgctcttaaa 3180 

ccaaatactg acttgaattc tgtagacaaa 3240 

gttgaagagt atgeiagtgga ceiagbgggag 3300 

attccaggtt ctcgtattca agcgaccggt 3360 

cttgtggtag aagaaggcaa tcctgcggca 3420 

gaggcoigtaa caggtcttac tagtcaaaaa 3480 

gctaagttgc cagaagtcac agcaagtgct: 3540 

agcgcagcaa acggcatgcg tgcgagcatc 3600 

caaacctatg caattcaatt ccttgaagaia 3660 

gtggaaaaag ctgacagtct caaagaagac 3720 

tatcaagatg gaacgcaagc bgtattacca 3780 

gaaggggaag tcgcaattcg taaaggaatg 3840 

ctgaacgctg aatatgaggg agctaaagac 3900 

gagaagaaga ttgcgcaatc catccgtcct 3960 

ccaagtcttc cagcaacagt aacagttgag 4020 

gtcacttggc aagctattcc gaaagaaaaia 4080 

ggtaaagttg eiaggaattga ccttgeiagcg 4140 

tcagttgaag aagtcagtgt gacaactcca 4200 

gtbcggacat atgattcaaa tggtcacgtt 4260 

cgtccagagc aatacgctaa ggaaggtgtc 4320 

caattctacaa ctaaacttca tgttcgcgta 4380 

tctgaccaat ggaccggttc agaattgcca 4440 

gacccagttt caiaatgttaa tgacaagcHc 4500 

tggaceiaact ggaatcgtac taatccagaa 4560 

ggtatcttga gcaaacgctc cgttgataat 4620 

gttggtgtac cgaagtctta tgtgattgag 4680 

cctoiaaaacc ctagttttgt tggtaatgag 4740 

aaaccagtta ctaatctaaa agcccctgct 4800 

agctttgata aagttgasiac ctatgctgtt 4860 

cgtggaacgt ctatcacaga ggtacaaatc 4920 

ggacaeuiccia gaatccaagt tgacggcaaa 4980 

gactactacc ttgagtctgt agatggaaaa 5040 

aatggtctcg ctaccgtcgt tccaagcgtt 5100 

aaagctgaaa atggcgacat cttaggagsLa 5160 

ttactttctc ataaaccagt tgcbgcggtt 5220 

gcacttgaat tgccgactaa ggttccagtt 5280 

aaagacctga cagttgaatg ggaagaagtt 5340 

tttactgttc gaggccgtgt ccttggfcagt 5400 

acagacaaac btggtgagac bcbbbcagab 5460 

gccbbbgcbb cagcaaccaa bgababbgac 5520 

aabgacggag abcabbcaga aaabcgbcgb 5580 

aatccagaag babcagcggg tgbgabbbbc 5640 

gbbacacaag gaaaagbbca gbbcbbbgca 5700 

gbbtbagaac gcbabgbcgg bccagagbbb 5760 

gcctacgacg cagaccatcc atbcaacaab 5820 

gcggataaag acabbgcagc bggbgabgaa 5880 

aaiagcbabga gabggcgbab ggagcgbaeia 5940 

abgaccbbcc bbgcaccaag bgaabbgccb 6000 

gabggaaaag aactbgcbga bbtcgcbgaa 6060 

ggbcaacggc caaaiagbcbc agbbgaagaa 6120 

agbggagaiag abagcbbbcc agbacbbgbb 6180 

aaggaabacc gbabccacbb gacbaiaggaa 6240 

gtacaagaag abcbbccaaa aabcgaabbb 6300 
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gttgaaaaag atttggcata caagacagtt gagaaaaaag attociacact gtatctaggt 6360 

gaaactcgtg tagaacaaga aggaaaagtt ggaaaagaac gtatctttac agcgattaat 6420 

cctgatggaa gtaaggaaga aaaactccgt gaagtggtag aagttccgac agaccgcatc 6480 

gtcttggttg gaaccaaacc agtagctcaa gaagctaaaa aaccacaagt gtcagaaaaa 6540 

gcagatacaa aaccaattga ttcaagtgaa gctagtcaaa ctaatastagc ccagttacca 6600 

agt:acaggt:a gtigcggcaag ccaagcagca gtagcagcag gtttaactct tctaggtttg 6660 

agtigcaggat tagtagttac taaaggtaaa aaagaagac 6699 

SeqID 35 

atggctcctt ctgtagtgga cgcagccacc tatcactatg taaataaaga gattatttca 60 

caagaagcta aagatttaat tcagacagga aagcctgaca ggaatgaagt tgtatatggt 120 

ttggtgtatc aaaaagatca gttgcctcaa acagggacag aagcatctgt tttgacagct 180 

tttggtttgc tgactgttgg gagcttgctt ttaatctaca agagaaagaa aattgctagc 240 

gtctttctag ttggagctat gggattggta gttcttccta gtgcaggggc tgtagaccca 300 

gttgcgaccc tagcgctggc tagtcgagag ggtgttgttg aaatggaggg ctatcgctat 360 

gttggttatc taticaggtga catcctcaaa acgcttggct tggacactgt tttagaagaia 420 

acctcagcaa aacctggaga ggtgactgtg gtcgaagttg agactcctca atcaataaca 480 

aatcaggagc aagctaggac agaaaaccaa gtagtagaga cagaggaagc tccaaaagaa 540 

gaagcaccta aaacagaaga aagtccaaag gaagaaccaa aatcggaggt: aaaacctact 600 

gacgacaccc ttcctaaagt: agaagagggg aaagaagatt cagcagaacc agctccagtt 660 

gaagaagtag gtggagaagt tgagtcaaaa ccagaggaaa aagtagcagt taagccagaa 720 

aglicaaccat cagacaaacc agctgaggaa tcaaaagttg aacstagcagg tgaaccagtc 780 

gcgccaagag aagacgaaaa ggcaccagtc gagccagaaa agcaaccaga agctcctgaa 840 

gaagagaagg ctgtagagga aacaccgaaa caagaagagt caactccaga taccaaggct 900 

gaagaaactg tagaaccacka agaggagact gttaatcaat ctattgaaca accaaaagtt 960 

gaaacgcctg ctgtagaaaa acaaacagaa ccaacagagg aaccaaaagt tgaacaagca 1020 

ggtgaaccag tcgcgccaag agaagacgaa caggcaccaa cggcaccagt tgagccagaa 1080 

aagcaaccag aagttcctga agaagagaag gctgtagagg aaacaccgaa accagaagat. 1140 

aaaatcUiagg gtattggtac taaagaacca gttgataaaa gtgagttaaa taatcaaatt 1200 

gafcaaagcta gttcagtttc tcctactgat tattctacag caagttacaa tgctcttgga 1260 

cctgttttag asiactgcaaa aggtgtctat gcttcagagc ctgtaaaaca gcctgaggta 1320 

aatagcgaga caaataaact taaaacggct attgacgctc taaacgttga baaaactgaa 1380 

btaaacaata cgattgcaga tgcaaaaaca aaggtaaaag aacattacag tgatagaagt 1440 

tggcaaaacc tccaaactga agttacaaag gctgaaaaag ttgcagctaa tacagatgct 1500 

aaacaciagtg aagttaacga agctgttgaa aaattaactg caactattga aaaattggtt 1560 

gaatbatctg aaaagccaat attaacatitg actagtaccg ataagaaaat attggaacgt 1620 

gaagctgttg ctaagtatac tctagaaaat caaaacaaaa caaaaatcaa atcaatcaca 1680 

gctgaattga aaaaaggaga agaagttatt aatactgtag tccttacaga tgacaaggta 1740 

acaacagaaa ctataagcgc tgcatttaag aacctagagt actacaaaga atacacccta 1800 

tctacaacta tgatttacga cagaggtaac ggtgaagaaa ctgaaactct agaaaatcaa 1860 

aatattcaat tagatcttaa aaiaagttgag cttaaaaata ttaaacgtac agatttaatc 1920 

aaatacgaaa atggaaaaga aactaatgaa tcactgataa caactattcc tgatgataag 1980 

agcaattatt atttaaaaat aacttcaaat aatcagaaaa ctacattact agctgttaaa 2040 

aatatageiag aaactacggt taiacggaaca cctgtatata aagttacagc aatcgcagac 2100 

aattt:agt:ct:: ctagaactgc tgataataaa ttitgaagaag aatacgttca ctatattgaa 2160 

aaacctaaag tccacgaaga taatgtatat tatsiatttca aagaattagt ggaagctatt 2220 

caaaacgatc cbtcaaaaga atatcgtcHg ggacaatcaa tgagcgctag aaatgttgtt 2280 

cctaatggaa aatcatatat cactaeiagaa ttcacaggaa aacttttaag ttctgaagga 2340 

aaacaatttg ctattactga attggaacat ccattattta atgtgataac aaacgcaacg 2400 

ataaabaatg tgaattttga aaatgtagag atiagaacgtt ctggtcaaga taatattgca 2460 

tcattagcca atactatgaa aggttcttca gttattacaa atgtcaaaat tacaggcaca 2520 

ctttcaggtc gtaataatgt tgctggattt gtaaataata tgaatgatgg aactcgtatt 2580 

gaaaatgttg ctttctttgg caiaactacac tctacaagtg gaaatggctc tcatacaggg 2640 

ggaattgcag gtacoLaacta tagaggaatt gttagaaaag catatgttga tgctactatt 2700 

acaggaaaca aaacacgcgc cagcttgtta gttcctaaag tagattatgg attaactcta 2760 

gaccatctta ttggtacaaa agctctccta actgagtcgg ttgtaaaagg taaaatagat 2820 

gtttcaaatc cagtagaagt tggagcaata gcaagtaaga cttggcctgt aggtacggta 2880 

agtaattctg tcagctatgc taagattatc cgtggagagg agttattcgg ctctaacgac 2940 

gttgatgatt ctgattatgc tagtgctcat ataaaagatt tatatgcggt agagggatat 3000 

tcgtcaggta atagatcatt taggaaatct aaaacattta ctaaattaac taaagaacaa 3060 

gctgatgcta aagttactac tttcaatatt actgctgata aattagaaag tgatctatct 3120 

cctcttgcaa aacttaatga agaaaaagcc tattctagta ttcaagatta taacgctgaa 3180 

tataaccaag cctataaaaa tcttgaaaaa ttaataccat tctacaataa agattatatt 3240 

gtatatcaag gtaataaatt aaataaagaa caccatctaa atactaaaga agttctttct 3300 

gttaccgcga tgaacaacaa tgagtttatc acaaacctag atgeiagctaa taaaattatt 3360 

gttcactatg cggacggtac aaaagattac tttaacttgt cttctagcag tgaaggttta 3420 

agtaatgtaa aagaatatac tataactgac ttaggaatta aatatacacc taatatcgtt 3480 

caaaaagata acactactct tgttaatgat ataaaatcta ttttagaatc agtagagctt 3540 
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cagtctcaaa cgatgtatca gcatctaaat cgattaggtg actatagagt t£Uitgcaatc 3600 

aaagatttat atttageiaga aagcttcaca gatgttaaag aaaacttaac aaacctaatc 3660 

acaaaattag ttcaaaacga agaacatcaa ctaaatgatt ctccagctgc tcgtcaaatg 3720 

attcgtgata aagtcgagaa aaacaaagca gctttattac tagglietaac ttacctaaat 3780 

cgttactatg gagttaaatt tggtgatgtt aatattaaag aattaatgct attcaaacca 3840 

gatttctatg gtgaaaaagt tagcgtatta gacagattaa ttgaaatcgg ttctaaagag 3900 

siacaacatta aaggttcacg tacattcgac gcattcggtc aagtattggc tciaatatact 3960 

aaatcaggta atttagatgc atttttaaat tataatagac aattgttcac aaatatagac 4020 

aatatgaacg attggtittat tgatgctaca gaagaccatg tctacatcgc agaacgcgct 4080 

tctgaggtcg aagaaattaa aaattctaaa catcgtgcat tcgatsutttt astaacgaagt 4X40 

caccttagaa atactatact cccactactg aatattgata aagcacatct ttatttaatt 4200 

tcaaattata atgcaattgc ctttggtagt gcagagcgat taggtaaaaia atcattagaa 4260 

gatattaaag atatcgttaa caaagctgca gatggttata gaaactatta tgatttctgg 4320 

tatcgtctag cgtctgataa cgttaaacaa cgactactaa gagatgctgt tattcctatt 4380 

tgggaaggtt ataacgctcc tggtggatgg gttgaaaaat atggccgcta taataccgac 4440 

aaagtatata ctcctcttag agaattcttt ggtcctatgg ataagtatta taattataat 4500 

ggaacaggag cttatgctgc tatatatcct aactctgatg atattagaac tgatgtaaaa 4560 

tatgttcatt tagaaatggt tggtgaatac ggtatttcag tttacacaca tgaaacaaca 4620 

cacgtcaacg accgtgcgat ttacttaggt ggctttggac accgtgaagg tact gat get 4680 

gaagcatatg ctcagggtat gctacaaact cctgttactg gtagtggatt tgatgagttt 4740 

ggttctttag gtattaatat ggtatttaaa cgcaaaaatg atgggaatca gtggtatatt 4800 

acagatccaa aaactctaaa aacacgagaa gatattaata gatatatgaa gggttataat 4860 

gacactttaa ctcttcttga tgaaattgag gctgaatctg tgatttctca acaaaataaa 4920 

gatttaaata gtgcatggtt Ceiaaaaaata gatagagsiat accgtgataa caataaatta 4980 

aatcaatggg ataaaattcg aaatctaagt caagaagaga aaaatgaatt asiatattcaa 5040 

tctgttaatg atttagttga tcaacoiatta atgactaatc gcaatccagg taatggtatc 5100 

tataaacccg aagcaattag ctataacgat caatcacctt atgtaggtgt tagaatgatg 5160 

accggtatct acggaggtaa tactagtaaa ggtgctcctg gagctgtttc attcaaacat 5220 

aatgctttta gattatgggg ttactacgga tacgaaaatg ggttcttagg ttatgcttca 5280 

aataciatata aacaacaatc taaaacagat ggtgagtctg ttctaagtga tgutatatt 5340 

atcaagaaaa tatctaacaa tacatttaat actattgaag aatttaaasui agcttacttc 5400 

aaagaagtta aagataeiagc aacgeiaagga ttaacaacat tcgaagtaaa tggttcttcc 5460 

gtttcatcat acgatgattt actgacattg ttteiaagaag ctgttaaaaa agatgccgaa 5520 

actcttaaac siagaagcaaa cggtatataeia acagtatcta tgaataatac agttasiatta 5580 

aaageuigctg tttataagaa acttcttcaa caaacaaata gctttaaaac ttcaatcttt 5640 

aaa 5643 

SeqXD 36 

atgaataaac gtctattttc aaaaatgagt ctggtgacgt tgccaatttt agccttgttt 60 

tcacaatcag ttttggcgga agaaaacatc catttttcga gctgteuigga agcttgggcg 120 

aatggctatt cggatattca cgagggagaa cctggttatt ctgccaagtt agaccgtgat 180 

catgatggtg tggcttgcga attgaiaaaat gctcctaagg gtgcttttaa agcaaaacag 240 

tcaacggcta ttcaaatcaa cacaagttca gcaacaacaa gtggttgggt taagcaggac 300 

ggcgcttggt actactttga tggasiatgga aatctagtga aaaiatgcatg gcagggaagc 360 

tattacctga aagctgatgg taaaatggca cagagtgaat ggatttatga ctcttcttat 420 

caagcttggt attatttgaa ateagatggt tcttatgcaa aaaatgcatg gcaaggagct 480 

tattacctta siatcaaacgg taaaatggca caaggtgagt gggtttatga ttcttcttac 540 

caagcatggt attacttgaa ateagatggt tcatatgctc gcaatgcatg gcaaggaaac 600 

tactatttga aatcagatgg taaaatggct aaaggtgaat gggtttatga tgccacctat 660 

saagcttggt attatttgac ateagatggt tcttatgctt acagtacatg gcaaggaaat 720 

kactatctaa aatcggatgg tasiaatggct gtcaatgaat gggttgatgg tggacgttat 780 

zatgttggcg ctgacggagt ttggaaggaa gttcaagcaa gtacagcttc ttctagtaat 840 

jatagcaata gtgaatattc tgctgcttta ggaaaggcaa aaagttataa ttcgttattc 900 

3acatgtcaa aaaaacgtat gtatagacaa ttaacttctg attttgataa attttcaaat 960 

yatgcagctc aatatgccat tgatcattta gatgat 996 

JeqID 37 

itgaaagtaa tagatcaatt taaaaataag aaagtccttg ttttaggttt ggccaagtct 60 

rgtgaatctg cagctcgttt gttggacaag ctaggtgcca ttgtgacagt aaatgatggg 120 

laacctttcg aggacaatcc agctgcccaa agtttgctgg aagaagggat caaggtcatt 180 

icaggtggcc atcetttgga actcttggat gaagagtttg cccttatggt gaaaaatcca 240 

rgtatcccet acsuicaatcc catgattgaa aaggctttgg ccaagggaat tccagtcttg 300 

ictgaggtgg aattggctta tttgatttca gaagcaccga ttattggtat cacaggatcg 360 

.acggtaaga caaccacaac gactatgatt ggggsiagttt tgactgctgc tggccsuicat 420 

•gtcttttat cagggaatat cggctatcca gctagtcagg ttgctcaaat agcatcagat 480 

aggacacgc ttgttatgga actttcttct ttccaactca tgggtgttca agaattccat 540 

cagagattg cggttattac caacctcatg ceaactcata tcgactacca tgggtcattt 600 

eggaatatg tagcagceaa gtggaatatc cagaacaaga tgacagcagc tgatttcctt 660 
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gtcttgaact ttaatcaaga cttggcaaaa gacttgactt ccaagacaga agccactgtt 720 

gtaccatttt caiacacttga aaaggttgat ggagcttatc tggaagatgg tcaactctac 780 

tt:ccgtggt:g aagt:agt:cac ggcagcgaat gaaatcggtg ttccaggtag ccacaatgtg 840 

gaaaatgccc ttgcgactat tgctgtagcc aagcttcgtg atgtggacaa tcaaaccatc 900 

aaggaaactc tttcagcctt cggtggtgtc aaacaccgtc tccagtttgt ggatgacatc 960 

aagggtgtta aattctataa cgacagtaaa tcaiactaata t:cttggct:ac tcaaaaagcc 1020 

ttgtcaggat ttgacaacag caaggtcgtc ttgattgcag gtggtttgga ccgtggcaat 1080 

gagtttgacg aattggtgcc agacattact ggactcaaga agatggtcat cctgggtcaa 1140 

tctgcagaac gtgtcaaacg ggcagcagac aaggctggtg tcgcttatgt ggaggcgaca 1200 

gatattgcag atgcgacccg caaggcctat gagcttgcga ctcaaggaga tgtggttctt 1260 

cttagtcctg ccaatgctag ctgggatatg tatgctaact ttgaagtacg tggcgacctc 1320 

tttatcgaca cagtagcgga gt^taaaagaa 1350 

SeqID 38 

atgaagaaaa aatttgccct atcgtttgtg gcgcttgcaa gtgtagcact tcttgcagcc 60 

tfftggagaag tgaagtctgg agcagtcaac actgctggta actcagtaga ggaasiagaca 120 

attaaaatcg ggtttaactt tgaagaatca ggttctttag ctgcatacgg aacagctgaa 180 

Cciaaaaggtg cccaattggc tgttgatgaa atcaatgcog caggtggtab cgatggaaaa 240 

caaatcgaag tagtcgataa agataataag tctgaaacag ctgaggctgc ttcagttaca 300 

actaaccttg taacccaatc taaagtatca gcagtcgtag gacctgcgac atctggtgcg 360 

actgcagctg cggtagcgaui cgctacaaaa gcaggtgttc cattgatctc accaagtgcg 420 

actcaagatg gattgactaa aggtcaagat tacctcttta ttggaacttt ccaagatagc 480 

ttccaaggaa aaattatctc aaactatgtt tctgaaaaat tsuiatgctaa gaaiagttgtt 540 

Gtttacactg acaatgccag tgactatgct aaagggattg caaaatcttt ccgcgagtca 600 

tacaagggtg aaatcgttgc agatgaaact ttcgtagcag gtgacacaga cttccaagca 660 

gcccttacaa aaatgaaagg gaaagacttt gatgctatcg ttgttcctgg ttactataat 720 

gaggctggta aaattgtaaa ccaagcgcgt ggcatgggaa ttgacaaacc aatcgttggt 780 

ggtgatggat tcaacggtga ggagttbgta caacaagcaa ctgctgaaaa agcatcaaac 840 

abcbacttta tctcaggctt ctcaactact gtagaagttt cagctsiaagc tauiagccttc 900 

Gttgacgctt accgtgctaa gtacaatgaa gagccttcaa catttgcagc cttggcttat 960 

gattcagttc accttgtagc aaacgcagca aaaggtgcta asiaattcagg tgaaatcaag 1020 

aataaccttg ctaaaacaaa agattttgswi ggtgtaactg gtcaaacaag cttcgatgca 1080 

gaccacaaca cagtcaaaac tgcttacatg atgaccatga acaatggtaa agttgaagca 1140 

gcagaagttg taaaacca 1158 

SeqID 39 

atgagtattt tagaagttaa aaatctgagt cacggttttg gtgaccgtgc aatttttgaa 60 

gatgtgtcct tccgtctcct caagggagaa catatcggcc tggtcggtgc caatggtgaa 120 

ggaaaatcaa cctttatgag tatcgtgact ggtaaaatgc tgccagatga aggaaaggtt 180 

gagtggtcca aatatgtgac ggctggttac ttggatcagc actetgtcct tgctgaaaga 240 

cagtcggtgc gtgatgttct ccgtacggct tttgatgagc ttttcaaagc tgaagctcgt 300 

atcaatgacc tctatatgaa aatggctgaa gacggcgcgg atgttgatgc tctcatggaa 360 

gaagtaggag aacttcaaga ccgtctggag agtcgtgatt tctatacctt ggatgctaag 420 

attgacgaag tagcgcgtgc tcttggtgtt atggactttg gcatggatac ggatgtaact 480 

tctttgtcag gtgggcaaag aaccaaggtg cttttggcaa ctacttctcct tgaaaagcct 540 

gatatcttgc tgttggacga gccgaccaac tacttggatg ctgagcatat tgattggctc 600 

aagcgctatc tccaaaacta tgagaatgcc tttgttctca tttcgcacga tattccattc 660 

ctcaatgacg ttattaatat tgtctatcat gtggaaaatc aacagctgac gcgttactct 720 

ggtgactact accagttcca agaagtttat gcbatgaaga aatctcaget agaggcagcc 780 

tacgaacgcc agcagaaaga gattgcagac ctcaaggact ttgtggctcg taataaagcc 840 

cgtgttgcaa ctcgtaatat ggctatgtct cgtcaaaaga aattggataa gatggatatt 900 

atcgaactcc aaagtgagaa accaaaacca tcctttgatt tcaaaccagc tcgtacacca 960 

gggcgcttta tcttccaagc caagaacttg caaattggtt acgaccgtcc tcttactaag 1020 

cctttaaatc ttaccttcga acgcaatcaa aaggttgcga ttattggtgc taatggtatt 1080 

grgaaaaacaa ctctcttgaa gagtctcttg ggcattatct cgccaatcgc tggggaagtg 1140 

jagcgtggag attatttaga aettggttat tttgagcagg aagtagaagg cggtaatcgc 1200 

raaactcctc ttgaagctgt ctggaatgcc tttcctgccc ttaatcaagc agaagtccgt 1260 

jcagcccttg cccgttgtgg tttgacaacc aaacatattg aaagccagat tcaagtatta 1320 

icagggggag agcaagccaa ggttcgtttc tgtctcttga tgaatcgtga asuusaacgtt 1380 

rtagtgctgg acgagccgac caaccatttg gatgtggat:g caaaggatga gctcaaacgc 1440 

jctctcaaag aatatagggg atctatcctt atggtctgcc acgagccaga cttttatgaa 1500 

rgctggatag accaaatatg ggattttaat aatttaact 1539 

JeqiD 40 

itgaagaaaa agaatggtaa agctaaaiaag tggcaactgt atgcagcaat cggtgctgcg 60 

igtgtagttg tattgggtgc tggggggatt ttactcttta gacMccttc tcagactgct 120 

ttaaaagatg agcctactca tcttgttgtt gccaaggaag gaagcgtggc ctcctctgtt 180 

itattgtcag ggacagtaac agcaaaaaat gaacaatatg tttattttga tgctagtaag 240 
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ggtgatttag atgaaatcct tgtttctgtg ggcgataagg tcagcgaagg gcaggcttta 300 

gtcaagtaca gtagttcaga agcgcaggcg gcctatgatt cagctagtcg agcagtagct 360 

agggcagatc gtcatatcaa tgaactcaat caagcacgaa atgaagccgc ttcagctccg 420 

gctccacagt taccagcgcc agtaggagga gaagatgcaa cggt:gcaaag cccaactcca 480 

gtggctggaa attctgttgc ttctattgac gctcaattgg gtgatgcccg tgatgcgcgt 540 

gcagatgctg cggcgcaatt aagcaaggct caaagtcaat tggatgcaac aactgttctc 600 

agtaccctag agggaactgt ggtcgaagbc aatagcaatg tttctaaatc tccaacaggg 660 

gcgagtcaag ttatggttca t^attgticagc aatgaaaatt tacaagtcaa gggagaat^tg 720 

tctgagtaca atctagccaa cctttctgta ggtcaagaag taagctttac ttctaaagtg 780 

tatcctgata aaaaatggac t:gggaaatta agctatattt ctgactatcc taaaaacaat 840 

ggtgeiagcag ctagtccagc agccgggaiat aatacaggtt ctaaataccc ttatactatt 900 

gatgtgacag gcgaggttgg tgatttgaaa caaggttttt ctgtcaacat tgaggttaaa 960 

agcaiaaacta aggctattct tgttcctgtt agcagtctag taatggatga tagtaaaaat 1020 

tatgtctgga ttgbggatga acaacaaaag gctaaaaaag ttgaggtttc attgggaaat 1080 

gctgacgcag aaaatcaaga aatcacttct ggtttaacga acggtgctaa ggtcatcagt 1140 

aatccaacat cttccttgga agaaggaaaa gaggtgaagg ctgatgaagc aactaat 1197 

SeqXD 41 

tcagaaacaa atcacgaaat tgattcasiat tttgcaggtc gtttaaatat cctgcgtgcg 60 

ggtgttcttg atgctaacga tggaattatt tccattgctg gtgtggttat cggagbtgcc 120 

agtgccacga ccaatatctg gattatcttt ttatcaggtt ttacggctat cttagctggt 180 

gccttttcaa tggctggtgg agaatatgta tccgtttcaa ctccaaaaga taccgaggaa 240 

gctgccgttt cgcgagciaaa actcttgcta gaccaagata gggaactagc caaaaaatcc 300 

ctctatgctg cttatatcca aaatggagaa ttcaaaactt ctgcccaact cttgaccaat 360 

aagatctttc ttaaaaatcc actcaaggct ctggtagagg aaaaatatgg gattgagtat 420 

gciagaattta ccaatccttg gcacgctgcc atttctagct tcgttgcctt tttccttaga 480 

agtttgcctc caatgctgtc agtgaccatt tttccaagtg attaccgcat ccctgctacc 540 

gtccttattg tcggtgtggc ccttcttetc actggttaca caagtgctag acttggaaaa 600 

gccccaacca aaacagctat gattcggaac cttgctattg gtctcttgac catgggagtt 660 

accttcctgc tcggacaact tttcagcatt 690 

SegXD 42 

atgaaaaaga aatbaactag tttagcactt gtaggcgctt ttttaggttt gtcatggtat 60 

gggsiatgttc aggctcaaga aagtbcagga aabaaaabcc acbbbabcaa bgbbcaagaa 120 

ggbggcagtg abgcgabbab bcbbgaaagc eiabggacabb bbgccabggb ggabacagga 180 

gaagabbabg abbbcccaga bggaagbgab bcbcgcbabc cabggagaga aggaabbgaa 240 

acgbcbbaba agcabgbbcb aacagaccgb gbcbbbcgbc gbbbgaagga abbgggbgbc 300 

caaaaacbtg abtbbabbbb ggbgacccab acccacagbg abcababbgg aaabgbbgab 360 

gaabbacbgb cbaccbabcc agbbgaccga gbcbabctba agaaababag bgabagbcgb 420 

abbacbaabb cbgaacgbcb abgggabaab cbgbabggcb abgabaaggb bbbacagacb 480 

gcbgcagaaa aaggbgbbbc agbbabbcaa aababcacac aaggggabgc bcabbbbcag 540 

bbbggggaca bggababbca gcbcbabaab babgaaaatg aaacbgabbc abcgggbgaa 600 

bbaaagaaaa bbbgggabga caabbccaab bccbbgabba gcgbggbgaa agbcaabggc 660 

aagaaaabbb accbbggggg cgabbbagab aabgbbcabg gagcagaaga caagbabggb 720 

ccbcbcabbg gaaaagbbga bbbgabgaag bbbaabcabc accabgabac caacaaabca 780 

aabaccaagg abbbcabbaa aaabbbgagb ccgagbbbga bbgbbcaaac bbcggabagb 840 

cbaccbbgga aaeiabggbgb bgabagbgag babgbbeiabb ggcbceuiaga acgaggaabb 900 

gagagaabca acgcagccag caaagacbab gabgcaacag bbbbbgabab bcgaaaagac 960 

ggbbbbgbca ababbbcaac abccbacaag ccgabbccaa gbbbbcaagc bggbbggcab 1020 

aagagbgcab abgggaacbg gbggbabcaa gcgccbgabb cbacaggaga gbabgcbgbc 1080 

ggbbggaabg aaabcgaagg bgaabggbab bacbbbaacc aaacgggbab cbbgbbacag 1140 

aabcaabgga aaaaabggaa caabcabbgg bbcbabbbga cagacbcbgg bgcbbcbgcb 1200 

aaaaabbgga agaeiaabcgc bggaabcbgg babbabbbba acaaagaaaa ccagabggaa 1260 

abbcrgbbgga bbcaagabaa agagcagbgg babbabbbgg abgbbgabgg bbcbabgaag 1320 

acaggabggc bbcaababab ggggcaabgg babbacbbbg cbccabcagg ggaaabgaaa 1380 

atgggcbggg baaeiagabaa agaaaccbgg bacbababgg abbcbacbgg bgbcabgaag 1440 

acaggbgaga bagaagbbgc bggbcaacab babbabcbgg aagabbcagg agcbabgaag 1500 

caaggcbggc abaaaaaggc aaabgabbgg babbbcbaca agacagacgg bbcacgagcb 1560 

gtgggbbgga bcaaggacaa ggabautabgg bacbbcbbga aagaaaabgg bcaabbacbb 1620 

gbgaacggba agacaccaga aggbbabacb gbggabbcaa gbggbgccbg gbbagbggab 1680 

gbbbcgabcg agaaabcbgc bacaabbaaa acbacaagbc abbcagaaab aaaagaabcc 1740 

aaagaagbag bgaaaaagga bcbbgaaaab aaagaaacga gbcaacabga aagbgbbaca 1800 

aabbbbbcaa cbagbcaaga bbbgacabcc bccLacbbcac aaagcbcbga aacgagbgba 1860 

aacaaabcgg aiabcagaaca g 1881 

SeqXD 43 

abggacbbag gbcccaccca aagaggbabb agbgbcgbgb cbcaabcbba babcaabgbb 60 

abcggbgcbg gbbbggcagg bbcbgaagca gcbbaccEuia bcgcagagcg bggbabbcca 120 
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gttaaactat atgaaatgcg tggtgtcaag tctacacccc agcataaaac agacaatttt 180 

gctgagttgg tttgttccaa ttctttgcgt ggggatgctt tgacaaatigG agttggtctt 240 

ctcaaggaag aaatgcgtcg cttgggttct gttatcttgg aatctgctga ggctacacgt 300 

gttcctgcag gtggtgccct tgcagtggac cgtgatggtt tctctcaaat ggtgaccgaa 360 

aaagttgcca accacccctt gattgaagtg gttcgtgatg aiaattacaga attgccgaca 420 

gatgttatta cggttatcgc tactggtcct ttgacaagtg atgccttggc tgaaaagatt 480 

catgctctta atgacggtgc tggtttttat ttctacgatg cggcagcgcc tattatcgat 540 

gtcaacacta tcgatatgag caaggtctac ctcaaatcac gttatgataa gggagaagcg 600 

gcctacctca atgcccctat gaccaagcaa gaatttatgg atttccatga agctttggtc 660 

aatgcagaag aagcaccgct tagttctttt gaaaaagaaa agtactttga aggatgtatg 720 

cctatcgaag tcatggccaa acgtggcatt aaaactatgc tttatggccc tatgaagcca 780 

gtcggtcttg agtacccaga cgactataca ggacctcgtg atggagaatt taaaacacct 840 

tatgcggttg tgcaacttcg tcaggataat gcagctggta gcctctacaa tattgttggt 900 

ttccagaccc acctcaaatg gggagaacaa aagcgtgtct tccaaatgat tccgggtctt 960 

gaaaatgcgg agtttgtccg ttatggtgtg atgcatcgca attcttacat ggatt caeca 1020 

ciatcttcttg agcagactta ccgttctaag aaacaaccaa atctcttctt tgctggtcaa 1080 

atgacgggtg tggaaggcta tgttgagtcg gcggcttcag gcttagttgc gggaattaac 1140 

gcagctcgtc tcttcaagga agaaagcgag get at tt tec ccgagaegac agcgattgga 1200 

agcttagetc attacattac ccatgccgae agcaaacatt tccaaccaat gaatgtcaat 1260 

tttgggatea teaaggagtt ggaaggegag cgtatccgtg ataagaaggc tcgttatgaa 1320 

aaaattgcag agcgtgccct tgccgactta gaggaatttt tgactgtc 1368 

SeqID 44 

atgttaatcg gaatcccaaa agaaattaaa aataacgaaa accgtgtcgc ccteacacct 60 

gcaggtgttc atagcttagt tagtcgtggt catcgtgtcc ttatcgaaac aaatgctggt 120 

ctcggttctg getttactga tgctgactat caaaageaag gagctgagat tgtcgetact 180 

gctggtgaag cttgggeagc agagttggtt gtgaaagtaa aagaatcttt aagttetgaa 240 

tacggttact tgcgcgacga tcttcttctc ttcacctact tgcacatggc cgctgctcea 300 

gaattagcag atgctatgtt aacagcaaaa acaactgaaa ctgttcgtga caatcaagga 360 

caactaccgc tcctcgttcc tatgagtgag gttgcaggtc gtatggctgt tcaaatcgga 420 

gctcacttcc ttactaagca agctggtggc tctggtgttc tacttggtgg tgtaccaggt 480 

gttceaaaag gaaaagtaac tatcatcggt ggtggtgtcg tcggtaeaca tgctgccegc 540 

atcgcccttg gtcttggtgc tcaagtgact attttagata ttagttccaa gcgtctctca 600 

gttctagaag aagtctttgg aagtcaaatt caaaetctta tgtctaattc attcaaeatt 660 

gaagcaagtg tgagagatgc tgatgtggtg attggagcca ttctcatccc tggtgcaaaa 720 

gcaccggaat tggtgacaga tgagatggtc aaacaaatgc gtccaggctc tgtatcgttg 780 

acgttgctgt tgaccaaggt ggcgttatcg aaacagctga ccgtgtgaca acgcacgatg 840 

aaccegtcta tgaaaiaacac ggtgttctcc actatgccgt tgccaatatc cctggtgcgg 900 

ttgctcgcac ttcaaccatc gccctaacca atgtcactct tccttatatc gaagctttgg 960 

ctggcaaagg attcgeacaa gcaatctetg aagatgaagg cttgegtcaa ggtgtgacta 1020 

cttatcaagg ttacttgact aacctaccag ttgctcaagg acttaatcgt gactacactg 1080 

atatceiatga tttagta 1097 

SegJD 45 

atgaaaatta ataaaaaata tctagcaggt tcagtggcag tccttgccct aagtgtttgt 60 

tcctatgaac ttggtcgtca ccaagctggt caggttaaga aagagtetaa tegagtttct 120 

tatatagatg gtgatcaggc tggteaaaag gcagaaaact tgacaccaga tgaagtcagt 180 

aagagggagg ggatcaacgc cgaacaaatc gtcatcaaga ttacggatca aggttatgtg 240 

acctctcatg gagaccatta teattactat aatggcaagg tcccttatga tgccateatc 300 

agtgaagagc tectcatgaa agatccgaat tatcagttga aggattcaga cattgtceiat 360 

gaaatcaagg gtggttatgt tatcaaggta gatggaaaat actatgttta ccttaaggat 420 

gcagctcatg cggataatat tcggacaaaa gaagagatta aacgtcagaa gcaggaacac 480 

agtcataatc acgggggtgg ttctaacgat caagcagtag ttgcagccag agcccaagga 540 

cgctatacaa cggatgatgg ttatatcttc aatgcatctg atatcattga ggacacgggt 600 

gatgcttata tcgttectca cggcgaccat taccattaca ttcctaagaa tgagttatca 660 

gctagcgagt tagctgctgc agaagcctat tggaatggga agcagggatc tcgtccttct 720 

tcaagttcta gttataatgc aaatccagct caaccaagat tgtcagagaa ceacaatctg 780 

actgtcactc caacttatca tcaaaatcaa ggggaasiaca tttcaagcct tttacgtgaa 840 

ttgtatgcta aacccttatc agaacgccat gtggaatctg atggccttat tttcgaccca 900 

gcgcaaatca caagtcgaac cgccagaggt gtagctgtcc ctcatggtaa ccattaccac 960 

tttatccctt atgaacaaat gtctgaattg gaaaaacgaa ttgctcgtat tattcccctt 1020 

cgttatcgtt eaaaccattg ggtaceagat tcaagaccag aacaaccaag tccacaatcg 1080 

actccggaac ctagtccaag tccgcaacct gcaccaaatc ctcaaccagc tccaagcaat 1140 

ccaattgatg agaaattggt caaagaagct gttcgaaaag taggcgatgg ttatgtcttt 1200 

gaggagaatg gagtttctcg ttatateeca gccaaggatc tttcagcaga aacagcagca 1260 

ggcattgata gcaaactggc caagcaggaa agtttatctc ataagctagg agctaagaaa 1320 

actgacctcc catctagtga tcgagaattt tacaataagg cttatgactt actageaaga 1380 

attcaccaag atttacttga taataaaggt cgacaagttg attttgaggc tttggataac 1440 
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ctgttggaac gactcaagga tgtcccaagt gataaagtca agttagtgga tgatattctt 1500 

gccttcttag ctccgattcg tcatccagaa cgtttaggaa aaccaaatgc gcaaattacc 1560 

tacactgatg atgagattca agtagccaag ttggcaggca agtacacaac agaagacggt 1620 

tatatctttg atcctcgtga tataaccagt gatgaggggg atgcctatgt aactccacat 1680 

atgacccata gccactggat taaaaaagat agtttgtctg aagctgagag agcggcagcc 1740 

caggcttatg ctaaagagaa aggtttgacc cctccttcga cagaccatca ggattcagga 1800 

aatactgagg caaaaggagc agaagctatc tacaaccgcg tgaaagcagc taageiaggt:g 1860 

ccactitgatc gtatgcctta caatcttcaa tatactgtag aagtcaaaaa cggtagttfca 1920 

atcatiacctc attatgacca ttaccataac atcaaatttg agtiggtttga cgaaggcct:t 1980 

tatgaggcac ctaaggggta tactcttgag gatcttttgg cgactgtcaa gtactatgtc 2040 

gaacatccaa acgaacgtcc gcattcagat aatggttttg gtaacgctag cgaccatgt:t 2100 

cgtaaaaata aggtagacca agacagtaaa cctgatgaag ataaggaaca tgatgaagta 2160 

agtgagccaa ctcaccctga atctgatgaa aaagagaato acgctggttt aaatccttca 2220 

gcagataatc tttataaacc aiagcactgat acggaagaga cagaggaaga agctgaagat 2280 

accacagatg aggctgaaat tcctcaagta gagaattctg ttattaacgc taagabagca 2340 

gatgcggagg ccttgctaga aaaagtaaca gatcctagta ttagacaaaa tgctatggag 2400 

acattgactg gtctaaaaag tagtcttctt ctcggaacga aagataatsia cactatttca 2460 

gcagaagtag atagtctctt ggctttgtta asiagaaagtc aaccggctcc tatacag 2517 

SeqXD 46 

atgaaattta gt a aa a aata tatagcagct ggatcagctg ttatcgtatc cttgagtcta ' 60 

tgtgcctat.g cactaaacca gcatogttcg caggaaaata aggacaataa ticgtgtctct 120 

tatgtggatg gcagccagtc aagtcagaaa agtgaaaact tgacaccaga ccaggttagc 180 

cagaaagaag gaattcaggc tgagcaaatt gtaatcaaaa ttacagatca gggctatgt:a 240 

acgtcacacg gtgaccacta tcattactat aatgggaaag ttccttatga tgccctcttt 300 

agtgaagaac tcttgatgaa ggatccaaac tatcaactta aagacgctga tattgtcaat 360 

gaagtcaagg gtggttatat catcaaggtc gatggaaaat attatgtcta cctgaaagat 420 

gcagctcatg ctgataatgt tcgaactaaa gatgaaatca atcgtcaaaa acaagaacat 480 

gtcaaagata atgagaaggt taactctaat gttgctgtag caaggtctca gggacgatat 540 

acgacaaatg atggttatgt ctttaatcca gctgatatta tcgaagatac gggtaatgct 600 

tatatcgttc ctcatggagg tcactatcac tacattccca aaagcgattt atctgctagt 660 

gaattagcag cagcbaaagc acatctggct ggaaaaaata tgcaaccgag tcagttaagc 720 

battcttcaa cagctagtga caataacacg caatctgtag ceiaaaggatc aactagcaag 780 

ccagcaouita aatctgaaaa tctccagagt cttttgaagg aactctatga ttcacctagc 840 

gcccaacgtt acagtgaatc agatggcctg gtctttgacc ctgctaagat tatcagtcgt 900 

acaccaaatg gagttgcgat tccgcatggc gaccattacc actttattcc ttacagcaag 960 

Gtttctgcct tagaagaaaa gattgccaga atggtgccta tcagtggaac tggttctaca 1020 

gtttctacaa atgcaaaacc toiatgaagta gtgtctagtc taggcagtct ttcaagcaat 1080 

ccttcttctt taacgacaag taaggagctc tcttcagcat ctgatggtta tatttttaat 1140 

ccaaaagata tcgttgaaga aacggctaca gcbbatattg taagacatgg tgatcatttc 1200 

catitacattc caaaatcaaa tcaaiattggg caaccgactc ttccaaacaa tagtctagca 1260 

acaccttctc catctcttcc aatcaatcca ggaacttcac atgagaaaca tgaagaagat 1320 

ggatacggat ttgatgctaa tcgtattatc gctgaagatg aatcaggttt tgtcatgagt 1380 

cacggagacc acaatcatta bttcttcaag aaggacttga cagaiagagca aabtsiaggcb 1440 

gcgcaaaaac atttagagga agttaaaact agtcataatg gattagattc tttgtcatct 1500 

catgaacagg attatccaag taatgccaaa gastatgaeuig atttagataa aaaaatcgaa 1560 

gaaaaaattg ctggcattat gaaacaatat ggtgtcaaac gtgaaagtat tgtcgtgaat 1620 

aaagaaaaaa atgcgattat ttatcogcat ggagatcacc atcatgcaga tccgattgat 1680 

gaacataaac cggttggaat tggtcattct cacagtaact atgaactgtt taaacccgaa 1740 

gaaggagttg ctaaiaaaaga agggaataaa gtttatactg gagaagaatt aacgaatgtt 1800 

gttaatttgt taaaaaatag tacgtttaab aabcaaaact bbacbcbagc caabggbcaa 1860 

aaacgcgbbb cbbbbagbbt bccgcctgaa bbggagaaaa aabbaggtab caababgcba 1920 

gbaaaabbaa baacaccaga bggaaaagba bbggagaaag babcbggbaa agtatbbgga 1980 

gaaggagbag ggaababbgc aetacbbbgaa bbagabcaac cbbabbtacc aggacaaaca 2040 

bbbaagbaba cbabcgcbbc aaaagabbab ccagaagbaa gbbabgabgg bacabbbaca 2100 

gbbccaaccb cbbbagcbba caaaabggcc agbcaaacga bbbbcbabcc bbbccabgca 2160 

ggggabacbb abbbaagagb gaacccbcaa bbbgcagbgc cbaaaggaac bgatgcttta 2220 

gbcagagbgb bbgabgaabb bcabggaaab gcbbabbbag aaaabaacba baaagttggb 2280 

gaaabcaaab baccgabbcc gaaabbaaac caaggaacaa ccagaacggc cggaaataaa 2340 

abbccbgbaa ccbbcabggc aaabgcbbab bbggacaabc aabcgacbba babbgbggaa 2400 

gbaccbabcb bggaaaaaga aaabcaaacb gabaaaccaa gbabbcbacc acaabbbaaa 2460 

aggaabaaag cacaiageiaaa obbaaaacbb gatgaaaagg bagaagaacc aeiagacbagb 2520 

gagaaggtag aaaaagaaaa acbbbcbgsia acbgggaaba gbacbagbaa bbcaacgbba 2580 

gaagaagbbc cbacagbgga bccbgbacaa gaaaaagtag caaaabbbgc bgasiagbbab 2640 

gggabgaagc bagaaaabgb cbbgbbbaab abggacggaa caabbgaabb ababbbacca 2700 

bcgggagaag bcabbaaaaa gaababggca gabbbbacag gagaagcacc bcaaggaaab 2760 

ggbgaaaaba aaccabcbga aaabggaaaa gbabcbacbg gaacagbbga gaaccaacca 2820 

acagaaaaba aaccagcaga bbcbbbacca gaggcaccaa acgaaaaacc bgbaaaacca 2880 
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gaaaactcaa cggataatgg aatgttgaat ccagaaggga atgtggggag tgaccctatg 2940 

ttagatccag cattagagga agctccagca gtagatcctg tacaagaaaa attagaaaaa 3000 

tttacagcta gttacggatt aggcttagat agtgttatat tcaatatgga tggaacgatt 3060 

gaattaagat tgccaagtgg agaagtgata aaaaagaatt tatqtgatct catagcg 3117 

SeqID 47 

atgaaaattt tatttgtagc agcagagggt gcaccctttt caaaaacagg tggtttggga 60 

gacgtcattg gcgctcttcc aaaatcactg gtaaaagctg ggcacgaagt tgcagtgatt 120 

ttaccctact atgatatggt agaggctaaa tttggaaatc agattgaaga tgtgcttcat 180 

tttgaggtga gcgttggttg gcgcagacag tattgtggaa ttaagaaaac agtattaaat 240 

ggtgtaacct tctactttat tgacaatcaa tattatttct tccgtggtca tgtttacggt 300 

gattttgatg acggagaacg ctttgccttt ttccaactgg ctgccattga ggctatggaa 360 

aggattgact ttattcctga tcttctccat gttcatgact accatacagc tatgattcct 420 

ttcttgttga aggaaaaata ccgttggatt caagcctatg aggacattga aacagtttta 480 

accattcata atttagaatt ccaaggacaa ttttcagaag gaatgttggg tgatttgttt 540 

ggagttggct ttgaacgtta cgctgatggc acccttcgat ggaacaactg tctgaactgg 600 

atgaaggcag gtattctcta tgcgaaccgt gtttcaaccg tttcacctag ctatgctcat 660 

gaaattatga ctagtcagtt tggatgtaat ttggatcaga ttcttaaaat ggagtctggt 720 

aaagtatctg gtatcgtgaa tgggattgat gctgatcttt ataatcctca gacggatgct 780 

cttttagact atcatttcaa tcaggaagat ttgtctggga aagccaaaaa taaggcaaaa 840 

ttgcaagaaa gagttggctt gcctgttaga gcagacgttc cactggtggg aattgtttct 900 

cgtttgacac gtcaaaaagg ttttgatgtg gtggtcgaaa gtcttcacca tatcttgcaa 960 

gaagatgttc agattgttct tttgggaact ggcgatccag cctttgaagg agctttctca 1020 

tggtttgctc agatttaccc agacaagcta tcaacaaata tcacttttga tgtcaaactt 1080 

gctcaggaaa tctacgctgc ttgtgacctc ttcctcatgc caagtcgttt tgaaccgtgt 1140 

ggcttgtctc aaatgatggc tatgcgttat ggaaccttgc cattggtcca tgaagttgga 1200 

ggcttgcgag atacagttcg cgctttcaat ccaatcgaag gaagcggtac tggctttagc 1260 

tttgacaatc tatctcctta ttggttaaat tggactttcc aaacagcatt ggacttgtat 1320 

agaaaccatc cagacatttg gagaaaccta caaaaacaag ctatggagag tgacttctca 1380 

tgggatacag cctgcaagtc ataccttgac ttgtaccata gtttagttaa t 1431 

SeqID 48 

atggaaaagt attttggtga aaaacaagag cgtttttcat ttagaaaatt atcagtagga 60 

cttgtatctg caacgatttc aagtttattt tttatgtctg tattagctag ttcatctgtg 120 

gatgctcaag aaactgcggg agttcactat aaatatgtgg cagattcaga gctatcatca 180 

gaagsiaaaga agcagcttgt ctatgatatt ccgacatacg tggagaatga tgatgaaact 240 

tattatcttg tttataagtt aaattctcaa aatcaactgg cggaattgcc aaatactgga 300 

agcaagaatg agaggcaagc cctagttgct ggtgctagct tagctgctat gggsiatttta 360 

atttttgctg tttccaagaa aaaggttaag aataaaacgg tattacattt agtattggtt 420 

gcagggatag gaaatggtgt cttagtttca gtccatgctt tagaaaatca tcttttgcta 480 

aattacaata cggactatga attgacctct ggagaaaaat tacctcttcc taaagagatt 540 

tcaggttaca cttatattgg atatatcaaa gagggaaaaa cgacttctga gtctgaagta 600 

agtaatcaaa agagttcagt tgccactcct acaaaacaac aaaaggtgga ttataatgtt 660 

acaccgaatt ttgtagacca tccatcaaca gtacaagcta ttcaggaaca aacacctgtt 720 

tcttcaacta agccgacaga agttcaagta gttgaaaaac ctttctctac tgaattaatc 780 

aatccaagaa aagaagagaa acaatcttca gattctcaag aacaattagc cgaacataag 840 

aatctagaaa cgaagaaaga ggagaagatt tctccaaaag aaaagactgg ggtaaataca 900 

ttaaatccac aggatgaagt tttatcaggt caattgaaca aacctgaact cttatatcgt 960 

gaggaaacta tggagacciaa aatagatttt caagaagaaa ttcaagaaaa tcctgattta 1020 

gctgaaggaa ctgtaagagt aaaacaagaa ggtaaattag gtaagaaagt tgaaatcgtc 1080 

agaatattct ctgtaaacaa ggaagaagtt tcgcgagaaa ttgtttcaac ttcaacgact 1140 

gcgcctagtc caagaatagt cgaaaaaggt actaaaaaaa ctcaagttat aaaggaacaa 1200 

cctgagactg gtgtagaaca taa.ggacgta cagtctggag ctattgttga acccgcaatt 1260 

cagcctgagt tgcccgaagc tgtagtaagt gacaaaggcg aaccagaagt tcaacctaca 1320 

ttacccgaag cagttgtgac cgacaaaggt gagactgagg ttcaaccaga gtcgccagat 1380 

actgtggtaa gtgataaagg tgaaccagag caggtagcac cgcttccaga atataagggt 1440 

aatattgagc aagtaaaacc tgciaactccg gttgagaaga ccaaagaaca aggtccagaa 1500 

aaaactgaag aagttccagt aaaaccaaca gaagaaacac cagtaaatcc aaatgaaggt 1560 

actacagaag gaacctcaat tcaagaagca gaaaatccag ttcaacctgc agaagaatca 1620 

acaacgaatt cagagaaagt atcaccagat acatctagca aaaatactgg ggaagtgtcc 1680 

agtaatccta gtgattcgac aacctcagtt ggagaatcaa ataaaccaga acataatgac 1740 

tctaaaaatg aaaattcaga aaaaactgta gaagaagttc cagtaaatcc aaatgaaggc 1800 

acagtagaag gtacctcaaa tcaagaaaca gaaaaaccag ttcaacctgc agaagaaaca 1860 

caaacaaact ctgggaaaat agctaacgaa aatactggag aagtatccaa taaacctagt 1920 

gattcaaaac caccagttga agaatceiaat caaccagaaa aaaacggaac tgcaacaaaa 1980 

ccagaaaatt caggtaatac aacatcagag aatggacaaia cagaaccaga accatcaaac 2040 

ggaaattcaa ctgaggatgt ttcaaccgaa tcaaacacat ccaattcaaa tggaaacgaa 2100 

geiaattaaac aagaaaatga actagaccct gataaaaagg tagaagaacc agagaaaaca 2160 
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cttgaattaa gaaatgtttc cgacctagag ttatacagtt tgtcaaatgg tacttataaa 2220 

caacacattt cgttagagca agttccaiagc aatccaaata gctactttgt taaagtgaaa 2280 

tcttcttcat tcaaagatgt atacctacca gtagcatcaa tatcagagga aagaaaaaat 2340 

gataaaatcc tttataaaat cacagcaaaa gtagagaagc ttcagcagga gatagaaagc 2400 

agatataaag ataattttac cttctatcta gctaagaagg gaacagaaga aacaacaaac 2460 

tttacttcct ttagtaatct ggtcaaagct ataaaccaaa atccctctgg aacctatcat 2520 

ttagcggcca gcctgaatgc taacgaagtg gagcttggtc ctgatgaaag atcctatatc 2580 

aaggacacct ttactggtcg tttaatcggt gaaaaagatg gcaagaatta tgctatctat 2640 

aatttgaaaa aacctctgtt tgaaaacttg agtggtgcta cagtagaaaa actgagtcta 2700 

aaaeiatgttg ctatttcagg gaaagatgat atcggttcac tggcaaatga agctcagaat 2760 

aacacaaaaa ttaagcaagt tcacgtcgat ggtgttctgg ctggtgaacg tggtatcggt 2 82 0 

ggtttgctgg ctaaggctga gceiatcaagc atcacagaga gcagtttcaa gggaagaatt 2880 

atcaacactt atgaaacgac tgctgcctac aatatcggtg gtatggtcgg tcatttgaca 2940 

ggtgacaagg ctttacttac taagtcaaaa gcgacagtag ccatttcatc taacacaaat 3000 

acttcagatc agactgtggg tggacttgca ggcctagtag accgagatgc acagatccaa 3060 

gatagctatg ctgaaggtga tatcaaceiat gtcaagcact ttggtagagt cgctggagtg 3120 

gcaggcaatt tgtgggatcg aacttctggt gatgttaggc atgctggaag tttgaccaat 3180 

gttctcagcg atgttaatgt aaccaacgga aatgccatca ctggttacca ctataacgaa 3240 

atgaaggtaa aggacacatt cagcagcaag gccaacagag tctacaiatgt caccttggtc 3300 

aaggatgagg tcgtcagcaa ggaatccttt gaagaaagag gucaatgct agatgcttct 3360 . 

caaattgcaa gcaaaaaagc agaaatcaat cetctcattt taccaacagt ggagccactt 3420 

tcaacaagtg gcaaaaaaga cagtgatttt tctaaggtgg cctattatca agctaagcgc 3480 

EUEicttgactt: ataaaaacat tgaaaaattg ctacctttct acaacaaggc aaccatcgtc 3540 

aaatacggaa acctggtcaa tgagaacagt cttttatatc aaaaagaact cttgtcagca 3600 

gtcatgatga aggacaacca agtcatcaca gacattgttt ctaacaaaca gactgcaaac 3660 

aaactcttgc ttcactacaa ggatgattta tctgagaagc tggatctcaa ataccagaat 3720 

gatttcgcca aattagcaga atatagtctg ggcaatactg gacttctcta tacgccaaac 3780 

caattcctgt atgaccaaac ctctatcatc aagcaagtct tacctgactt acaaaaggtt 3840 

gactatcabt cagaagccat cagaaagacg ctgggtattt ctccaaacgt caagcaaact 3900 

gagctctatc tagaagacca gttcgccaaa acaaaacaac aactggaaga cagtttgaaa 3960 

aaactcttgt cagcggatgc tggactggct agtgctaacc ccgtcactga aggttatctt 4020 

gtagataaaa tcaaacgcaa caaggaagcc ttgctacttg gcttgaccta tctggaacgg 4080 

tggtataact ttagctatgg tcaggtgaat gtcaaagacc tagttctgta ccatttggac 4140 

ttctttggta aggggaatgc ttcaccatta gatactctga ttgagttggg taaatctggc 4200 

tttaacaatc ttctagctaa gaataatgtc gatacttatg gtatcagtct tgccagtcaa 4260 

catggaacga cagatttgtt tagcacgctg gaacattacc gaaaagHctt tttaccaaat 4320 

acaagcaata atgactggtt taaatcagag actaaggctt acattgtcga agaaiaaatcc 4380 

actatcgaag aggtgaaaac gaagcaaggg ttagctggca ccaagtattc tatcggtgtt 4440 

tatgatcgta tcacgagtgc cacatggaaa taccgcaata tggtcttgcc tctcctgacc 4500 

ttgccagaga gatccgtatt tgtcatctcg accatgtcta gtctaggatt tggagcttat 4560 

gatcgctacc gcagtagtga ccataaagcg ggcaaggctc tcaatgattt tgttgaagaa 4620 

aatgcgcgtg aaacagccaa acgtcagcga gatcactacg attattggta tcgtatttta 4680 

gacgacaatg cacgtgaaaa actttataga aatattttgc tttacgatgc ttataaattt 4740 

ggcgatgata ataccgtagg gaaagctaca gaagtggcag attttgataa tccaaatcct 4800 

gcaatgcaac atttctttgg acctgttgga aataaagttg ggcataatca acacggtgct 4860 

tatgctacag gtgatgcagt ttattatatg ggttatcgaa tgttggatcui ggatggagct 4920 

attacttata cgcatgagat gacacatgac tcagatcagg acatttatct tggaggatat 4980 
ggtcgaagaa gtggcttggg accagagttc tttgctaaag gattattaca agcaccagac 5040 
catccagatg atgcgaccat taccatcaac tccatcttga aacaCtcaaa atctgatagt 5100 
acagaaagtc gacgattaca agtacttgat ccaactacaa gatttaataa tgcagatgat 5160 
ttgaagceiat atgtccacaa catgtttgac gttgtttata tgttggaata tctcgaagga 5220 
aatHcaattc t.taaattgga tacgeiatcaa siaacaacaac t.tcttagaaa agttacaaat 5280 
gagtaccatc ctgatcctga tggaaataag gtctatgcaa caaatgttgt cagaauitcta 5340 
acagtagaag aagttgaaag actacgttca ttcaatgatt tgattgataa taatattctt 5400 
tcgtctaggg aatatgcctc aggtaaatac gaaagaaatg gctacttcac tattaagtta 5460 
tttgcaccga tttatgctgc attaagtaat gatataggaa caccaggtga cctgatggga 5520 
cgtcgtatag cctatgaact actagctgct aaaggcttta aagatggtat ggtaccatat 5580 
atctcaaacc aatacgaaga agaagcceiaa caaaagggca agacaiatcaa tct.ctac9gt 5640 
aaaacaagag gtttggttac agatgacttg gttttggaaa aggtatttaa taaccaatat 5700 
catacttgga gtgagtttaa gaaagctatg tatcaagaac gacaagatca gtttgataga 5760 
ttgaacaaag ttacttttaa tgatacaaca cagccttggc aaacatttgc caagaaaact 5820 
acaagcagtg tagatgaatt acagaaatta atggacgttg ctgttcgtaa ggatgcagaa 5880 
cacaattact accattggaa taactacaat ccagacatag atagtgaagt ccacaagctc 5940 
aagagagcaa tctttaaagc ctatcttgac caaacaaatg attttagaag ttcaattttt 6000 
gagaataaaa aa 6012 



SeqXD 49 

atgeiaaatta atetaasiaata tctagcaggt tcagtggcag tccttgccct aagtgtttgt 



60 
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tcctatgagc ttggacgtta ccaagctggt caggataaga aagagtctaa tcgagttgct 120 

tatatagatg gtgatcaggc tggtcaaaag gcagaaaact tgacaccaga tgaagtcagt 180 

aagagggagg ggatcaacgc cgaacaaatt gttatcaaga ttacggatca aggttatcftg 240 

acctctcatg gagaccatta tcattactat aatggcaagg ttccttatga tgccatcatc 300 

agtgaagagc tcctcatgaa agatccgaat tatcagttga aggattcaga cattgtcaat 360 

gaaatcaagg gtggttatgt cattaaggta aacggtaaat actatgttta ccttaaggat 420 

gcagctcatg cggataatat tcggacaaaa gaagagatta aacgtcagaa gcaggaacgc 480 

agtcataatc ataactcaag agcagataat gctgttgctg cagccagagc ccaaggacgt 540 

tatacaacgg atgatgggta tatcttcaat gcatctgata tcattgagga cacgggtgat 600 

gcttatatcg ttcctcacgg cgaccattac cattacattc ctaagaatga gttatcagct 660 

agcgagttag ctgctgcaga agcctattgg aatgggaagc agggatctcg tccttcttca 720 

agttctagtt ataatgcsiaa tccagctcaa ccaagattgt cagagaacca caatctgact 780 

gtcactccaa cttatcatca aaatcaaggg gaaaacattt caagcctttt acgtgaattg 840 

tatgctaaac ccttatcaga acgccatgtg gaatctgatg gccttatttt cgacccagcg 900 

caaatcacaa gtcgaaccgc cagaggtgta gctgtccctc atggtaacca ttaccacttt 960 

atcccttatg aacaaatgtc tgaattggaa aaacgaattg ctcgtattat tccccttcgt 1020 

tatcgttcaa accattgggt accagattca agaccagaag aaccaagtcc acaaccgact 1080 

ccagaaccta gtccaagtcc gcaaccagct ccaagcaatc caattgatga gaaattggtc 1140 

aaagaagctg ttcgaaaagt aggcgatggt tatgtctttg aggagaatgg agtttctcgt 1200 

tatatcccag ccaaggatct ttcagcagaa acagcagcag gcattgatag caaactggcc 1260 

aagcaggaaa gtttatctca taagctagga actaagaaaa ctgacctccc atctagtgat 1320 

cgagaatttt acaataaggc ttatgactta ctagcaagaa ttcaccaaga tttacttgat 1380 

aataaaggtc gacaagttga ttttgaggct ttggataacc tgttggaacg actcaaggat 1440 

gtctcaagtg ataaagtcaa gttagtggaa gatattcttg ccttcttagc tccgattcgt 1500 

catccagaac gtttaggaaa accaaatgcg caaattacct acactgatga tgagattcaa 1560 

gtagccaagt tggcaggcaa gtacacaaca gaagacggtt atatctttga tcctcgtgat 1620 

at:aaccagtg atgaggggga tgcctatgta actccacata tgacccatag ccactggatt 1680 

aaaaaagata gtttgtctga agctgagaga gcggcagccc aggcttatgc taaagagaaa 1740 

ggtttgaccc ctccttcgac agaccatcag gattcaggaa atactgaggc aaaaggagca 1800 

gaagctatct acaaccgcgt gaaagcagct: aagaaggt:gc cacttgatcg tat.gcct.tac 1860 

aatcttcaat atactgtaga agtcaaaaac ggtagtttaa tcatacctca ttatgaccat 1920 

taccataaca tcaaatttga gtggtttgac gaaggccttt atgaggcacc taaggggtat 1980 

actcttgagg atcttttggc gactgtcaag tactatgtcg aacatccaaa cgaacgtccg 2040 

cattcagata atggttttgg taacgctagc gaccatgttc aaagaaacaa aaatggtcaa 2100 

gctgatacca atcaaacgga aaaaccaagc gaggagaaac ctcagacaga aaaacctgag 2160 

gaagaaaccc ctcgagaaga gaaaccgcaa agcgagaaac cagagtcfccc aaaaccaaca 2220 

gaggaaccag aagaatcacc agaggaatca gaagaacctc aggtcgagac tgaaaaggtt 2280 

gaagaaaaac tgagagaggc tgaagattta cttggaaaaa tccaggatcc aattatcaag 2340 

tccaatgcca aagagactct cacaggatta aaaaataatt tactatttgg cacccaggac 2400 

aacaatacta ttatggcaga agctgaaaaa ctattggctt tattaaagga gagtaag 2457 

SeqID 50 

ttgattttaa gtgtttgttc ttacgagttg ggactgtatc aiagctagaac ggttaaggaa 60 

aataatcgtg tttcctatat agatggaaaa caiagcgacgc aaaaaacgga gaatttgact 120 

cctgatgagg ttagcaagcg tgaaggaatc aatgctgagc aaatcgtcat caagataaca 180 

gaccaaggct atgtcacttc acatggcgac cactatcatt attacaatgg taaggttcct 240 

tatgacgcta tcatcagtga agaattactc atgaaagatc caaactataa gctaaaagat 300 

gaggatattg ttaatgaggt caagggtgga tatgttatca aggtagatgg aaaatactat 360 

gtttacctta aggatgctgc ccacgcggat aacgtccgta caaaagagga aatcaatcga 420 

caaaaacaag agcatagtca acatcgtgaa ggtggaactc caagaaacga tggtgctgtt 480 

gccttggcac gttcgcaagg acgctatact acagatgatg gttatatctt taatgcttct 54 0 

gatatcatag aggatactgg tgatgcttat atcgttcctc atggagatca ttaccattac 600 

attcctaaga atgagttatc agctagcgag ttggctgctg cageuLgcctt cctatctggt 660 

cgaggaaatc tgtcaaattc aagaacctat cgccgacaaa atagcgataa cacttcaaga 720 

acaaactggg taccttctgt aagcaatcca ggaactacaa atactaacac aagcaacaac 780 

agcaacacta acagtcaagc aagtcaaagt aatgacattg atagtctctt gaaacagctc 840 

tacaaactgc ctttgagtca acgacatgta gaatctgatg gccttgtctt tgatccagca 900 

caaatcacaa gtcgaacagc tagaggtgtt gcagtgccac acggagatca ttaccacttc 960 

atcccttact ctcaaatgtc tgaattggaa gaacgaatcg ctcgtattat tccccttcgt 1020 

tatcgttcaa accattgggt accagattca aggccagaac aaccaagtcc acaaccgact 1080 

ccggaaccta gtccaggccc gcaacctgca ccaaatctta aaatagactc aaattcttct 1140 

ttggttagtc agctggtacg aaaagttggg gaaggatatg tattcgaaga aaagggcatc 1200 

tctcgttatg tctttgcgaa agatttacca tctgaaactg ttaaaaatct tgaaagcaag 1260 

ttatcaaaac aagagagtgt ttcacacact ttaactgcta aaaaagaaaa tgttgctcct 1320 

cgtgaccaag aattttatga taaagcatat aatctgttaa ctgaggctca taaagccttg 1380 

tttgaeiaata agggtcgtaa ttctgatttc caagccttag acaaattatt agaacgcttg 1440 

aatgatgaat cgactaataa agaaaaattg gtagatgatt tattggcatt cctagcacca 1500 

attacccatc cagagcgact tggcaaacca aattctcaaa ttgagtatac tgaagacgaa 1560 
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gttcgtattg ctcaattagc tgataagtat acaacgtcag atggttacat ttttgatgaa 1620 

catgatataa tcagtgatga aggagatgca tatgtaacgc ctcatatggg ccatagtcac 1680 

tggattggaa aagatagcct ttctgataag gaaaaagttg cagctcaagc ctatactaaa 1740 

gaaaaaggta tcctacctcc atctccagac gcagatgtta aagcaaatcc aactggagat 1800 

agtgcagcag ctatttacaa tcgtgtgaaa ggggaaaaac gaattccact cgttcgactt 1860 

ccatatatgg ttgagcatac agttgaggtt aaaaacggta atttgattat tcctcataag 1920 

gatcattacc ataatattaa atttgcttgg tttgatgatc acacatacaa agctccaaat 1980 

ggctatacct tggaagattt gtttgcgacg attaagtact acgtagaaca ccctgacgaa 2040 

cgtccacatt ctaatgatgg atggggcaat gccagtgagc atgtgttagg caagaaagac 2100 

cacagtgaag atccaaataa gaacttcaaa gcggatgaag agccagtaga ggaaacacct 2160 

gctgagccag aagtccctca agtagagact gaasiaagtag aagcccaact caaagaagca 2220 

gaagttttgc ttgcgaaagt aacggattct agtctgaaag ccaatgcaac agaaactcta 2280 

gctggtttac gaeiataattt gactcttcaa attatggata acaatagtat catggcagaa 2340 

gcagaaaaat tacttgcgtt gttaaaagga agtaatcctt catctgtaag taaggaaaaa 2400 

ataaac 2406 

SeqID 51 

atgccagtag aaattaaaac cactaaagaa attcatccta aaatctatgc ctacaccaca 60 

ccgacagtiaa ccagtaatga aggctggatt aagattgggt atacagaacg tgatgtcaca 120 

caacgtat:ca aggagcaaac gcatacagct catatagcta cagatgtctt atggactggt 180 

gatgcagctt atacagaaga gcctgataag gggaaaactt tcaaggacca tgatttccac 240 

catttccttt ctttccatga tgtagaacgt cgtcccaaga cggaatggtt ctattttaat 300 

ggaactcctg aaaaatcaaa aaatcttttt gataagtttg ttcagcatga tttgtctggt 360 

tatcagcctg gaaaaggaca ggactatact ctgcgacaag agcaagaaga agcagttgct 420 

aagacattag cttatttcca agaacatgct ggaggcaagt ttctctggaa tgccaagcca 480 

cgctttggta aaaccttgtc tacctatgac ctagctcgac ggatggaagc tgtcaatgtc 540 

Gtaattgtaa caaaccgccc tgccattgct aactcatggt atgatgattt tgaaacattc 600 

atagcaggtc aaacgactta caagtttgtt tctgaatcag atagccttaa gagtcgtcca 660 

atcttgtcac gacaagaatt tcttggtatt ttagctgacg atgtaagaca acttgctttt 720 

at^cagbctcc aagacttgaa aggatctgtt tatttaggtg gagagcacga taaactcaaa 780 

tgggtaactg atctgcattg ggacttgttg gttattgacg aggctcatga aggagttgat 840 

accttcaaga ctgaccaagc ctttaataag attcgacgaa attttactct gcatttgtca 900 

ggtacatcat ttaaagcatt ggctaaagga gattttacag aggaacaaat ctacaactgg 960 

tcttatgctg atgagcaggc tgctaagtat tcgtggtctc ttgagcaaga agaggaaaat 1020 

ccttatgaaa gcttgcctca gttgaatctc tttacctatc aaatgtctca gatgattggc 1080 

gaaaagttag aaaaaggcgc tcagatcgat ggtgaaaata ttgactatgt ttttgactta 1140 

agtgaatttt tcgctacaga tgataaaggg aaatttattc atgagcatga tgteagaaat 1200 

tggttagata ctctatcaag csiatgaaaaa tatccatttt caaccaaaga actccgtaat 1260 

gaact:caagc atactttttg gcttttagaa cgtgtcgctt cggccaaagc at:taaaagcc 1320 

ctactagaag aacacccsiat ctatgaaaac tatgagatcg ttctagctgc tggtgacgga 1380 

cgtatgtccg aagaagacga tauaagtcaaa ctcaaatcct tggacttggt tagaaaagcg 1440 

atagcagaga atgacaaaac cattacccta tccgttggtc agctgacgac aggtgtcact 1500 

atccctgaat ggacaggtgt attgatgtta tcaaatttga aatcaccagc tctttatatg 1560 

eaggccgcct tccgtgctca aaatccttac tcatggagcg ataacasiagg aaatcacttt 1620 

cgcaaagaaa gagcctatgt: atttgacttt gcgccggaaa gaaccttgat tctctttgat 1680 

gagtttgcca acaacttatt gcttgtaact gcagctggta gaggaacttc agctacacgc 1740 

gaagaaaata ttagagaatt attaaacttc tttccaatta ttgccgaaga ccgtgctggt 1800 

aagatggttg aaattgatgc aaaggcagtt ctciaccactc ctcgccagat aaaagctaga 1860 

gaagttctta aacgaggttt tatgtccaat ctcttatttg abaatattag tggtattttc 1920 

caagcaagtc aaacagtttt agatatttta aatgagctgc cagttgaaaa ggaagggaag 1980 

gtacaagata gttctgattt attagatttt tcagatgtta cagtcgatga tgagggaaat 2040 

gcagtagtag accatgaaat tgtagttaat cagcaaatgc gactttttgg tgaaaaagtb 2100 

tatggacttg gtgaatctgt tgctgagtta gtcacaaaag atgaggaacg aiactcaaaaa 2160 

cagctggtca atgacttgag taagaccgtt tcttcagtga ttgtagagga attgaaagca 2220 

gattattctc taaaaacaag ggaaactgag caaattaaga aacaaattac agcaacactt 2280 

gagaatgaaa ttcgaiaaaaa tgatatcgaa agaaaaattt ctgaagctca tatcaagcaa 2340 

gagttgcaac agcagctcaa agaagcaaat gataaagcgc aaaaagataa gattcaagaa 2400 

gatttggaaa aacgtttaga agaaaataaa ctcattcata aagaaaaact agaacaaaca 2460 

ctcaaaaaag aagtggaaaa aatgcctgag aaatttatcg aacaggttga gataaaacgt 2520 

gtggaacagt bgaaacaatc agctcaagat gaaattcgtg accatttacg agggtttgca 2580 

agaacaattc caagttttat tatggcttac ggtgatcaaa ctctaacact tgataatttt 2640 

gatgcctttg tbcctgaaca tgttttttat gaagtaacag ggattacgat tgatcagttt 2700 

agatattbgc gagatggtgg gcaggatttt gcagggcatc tctttgataa agcaacattt 2760 

gacgeiagcta tucaagaatt tcttcgcaag asiaaaggagt tggcggatta ttttaaagat 2820 

caaaaagsLag acatttttga ctatattcca ccgcagaaga ccaaccaaat tttcactccb 2880 

aaacgagtgg tgaaaaggat: ggtagatgat ttggaaaagg aaaatccagg gatttttgat 2940 

gatccatcta agacttttat tgatttatat atgaagtcag gcctctatat tgcagaactt 3000 

gtgaagcggt tatataatag caatggcttg aaagaggcct ttccaGiatcc tgaagaacgc 3060 
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ttaaaacata ttttggaaaa gcaagtttat ggatttgctc cgtctgagat tatctataac 3120 

atttccacta attttatatt tggcaatctt tctaaagata tcagtaggaa gaattttgtt 3180 

ttagcagata ccattccagc ggct.aaagaa gggagcattc aaaagttggt tgattcctat 3240 

t.tt:gaaaata at ^^52 

SeqXO 52 

atgaaaaaaa tactsiattgt agatgatgag aaaccaatct cggatattat caagtttaat 60 

atgaccaagg aaggttacga agttgtaact gcttttaatg gtcgtgaagc gctagagcaa 120 

tttgaagcag agcaaccaga tattattatt ctggatttga tgcttccaga aattgatggt 180 

ttagaagttg ctaagaccat tcgtaagaca agcagtgtgc ccattcttat gctttcagcc 240 

aaagatagtg aatttgataa ggttatcggt ttggaacttg gggcagatga ctatgtaaca 300 

aaacccttct ccaatcgtga gttgcaggcg cgtgttaaag ctcttctgcg tcgttctcaa 360 

ccbatgccag tagatggtca ggaagcagat agtaaacctc aacctatcca aattggggat 420 

ttagaaattg ttccagacgc ctacgtggct aaaaaatatg gcgaagaact agacttaacc 480 

catcgtgaat ttgagctttt gtatcattta gcatcgcata caggtcaagt catcacgcgc 540 

gaacacttgc ttgagactgt ctggggttat gactattttg gtgatgtccg tacagttgat 600 

gtgactgtac gacgtctgcg tgagaagatt gaagatacgc ccagccgacc agagtatatc 660 

ttgacgcgcc gtggtgtagg gtattacatg agaaataatg ct 702 

SeqID 53 

atgaagaaaa aatttctagc atttttgcta attttattcc caattttctc attaggtatt 60 

gccaaagcag euiacgattaa gattgtttct gataccgcct atgcaccttt tgagtttaaa 120 

gattcagaec aaacttataa aggaattgat gttgacatta ttaacaaagt cgctgagatt 180 

aaaggctgga acattcagat gtcctatcct ggatttgacg cagcagtcaa tgcggttcaa 240 

gctgggcaag ccgacgctat catggcaggg atgacaaaga ctaaagaacg tgaaaaagtc 300 

ttcaccatgt ctgatactta ctatgataca aaagttgtca ttgctactac aaagtcacac 360 

aaaattagca agtacgacca attaactggc aaaaccgttg gtgttaaaaa cggaactgcc 420 

gctcaacgtt tccttgaaac aatcaaagat aaatacggct ttactattaa aacatttgac 480 

actggtgat-t taatgaacaa cagcttgagt gctggtgcca tcgatgccat gatggatgac 540 

aaacctgtta tcgaatatgc cattaaccaa ggtcaagacc tccatattga aatggatggt 600 

gaagctgtag gaagttttgc tttcggtgtg aaaaaaggaa gtaaatacga gcacctggtt 660 

actgaattta accaagcctt gt:ct.gaaatg aaaaaagatg gtagtcttga tstsiaattatc 720 

aagaaatgga ctgcttcatc atcttcagca gtgccaacta caactactct cgcaggatta 780 

aaagctattc ctgttaaggc taaatatatc attgccagcg attcttcttt tgcccctttt 840 

gttttccaaa attcaagcaa ccaatacact: ggtattgata tggaattgat taaggcaatc 900 

gctaaagacc aaggttttga aattgaaatc accaaccctg gttttgatgc tgctatcagt 960 

gctgtccaag ctggtcaagc cgatggtatc atcgctggta tgtctgtcac agatgctcgt 1020 

aaggcaactt ttgacttctc agaatcatac tacactgcta ataccattct tggtgtcaaa 1080 

gaatcaagca at:atLl:gct.t.c ttatgaagat ctaaaaggaa agacagtcgg tgttasiaaac 1140 

ggaactgctt ctcaaacctt cctaacagaa aatcaiaagca aatacggcta caaaatcaaa 1200 

acctttgctg atggttcttc aatgtatgac agtttaaaca ctggtgccat tgatgccgtt 1260 

atggatgatg aacctgttct caaatattct atcagccaag gtcaaaaatt gaaaactcca 1320 

atctctggaa ctccaatcgg tgaaacagcc tttgccgtta aaaaaggagc aaatccagaa 1380 

ctgattgaaa tgttcaacaa cggacttgca aaccttaaag ceiaacggtga atitccaaaag 1440 

attcttgaca aatacctagc tagcgaatct tcaactgctt caacaagtac tgttgacgaa 1500 

acaacgctct ggggcttgct tcaaaacaac tacaaacaac tccttagcgg tcttggtatc 1560 

actcttgctc tagctcttat ctcatttgct attgccattg tcatcggaat tatcttcggt 1620 

atgtttagcg ttagcccata caaatctctt cgcgtcatct ctigagabttt cgttgacgtt 1680 

attcgtggta ttccattgat gattcttgca gccttcatct tctggggaat tecaaacttc 1740 

atcgagtcta tcacaggcca acaaagccca attaacgact ttgtagctgg aiaccattgcc 180 0 

ctctcactca atgcggctgc ttatatcgct gaaatcgttc gt,ggt.ggt.at. tcaggccgtt 1860 

ccagtbggcc asiatggaagc cagccgaagc ttgggtatct cttabggaaa aaccatgcgt 1920 

aagattatct tgccacaagc aactaaattg atgttgccaa actttgtcaa ccaattcgtt 1980 

at.cgct:ct:ta aagatacsiac tatcgtatct gctatcggtt tggttgaact cttccaaact 2040 

ggbaagatita tcattgctcg taactaccaa agtttcaaga tgtatgcaat ccttgctatc 2100 

ttctatcttg taattatcac acttttgact agactagcga aacgcttaga aaagaggatt 2160 

cgt 2163 

SeqXO 54 

atggcatttg aaagtttaac agaacgttcg cagaacgtct ttaaaaatck acgtaaaaaa 60 

ggaaaaatct ctgaatctga tgtccsuigag gcaaccaaag aaattcgctt ggccttgctc 120 

gaggccgacg ttgccttgcc tgttgtaaag gactttatca agaaagttcg tgagcgtgca 180 

gtcgggcatg aggtcattga tacacttaat cctgcgcaac agattat^taa aatcgttgat 240 

gaggaattga cagccgtttt aggttctgat acggcagaaa ttatcaagtc acctaagatt 300 

ccaaccatica tcatgatggt tggtttacaa ggggctggta aaacaacctt tgctggtaaa 360 

ttggccaaca aactcaagaa agaagaaaat gctcgtcctt tgatgattgc ggcggatatt 420 

tatcgtccag ctgccattga ccagctcaag accttgggac aacagaCtga tgtgcctgtc 480 

bttgcactitg gaacagaagt accagcbgtb gagattgtac gtcaaggbtt ggagceiagcc 540 
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caaactaatc 
ctcctcatga 
gtcgttgatg 
ttggaagtga 
ctgtctgttc 
acggacattg 
atgctcactt 
gctgagaaga 
gtgcaaaata 
ccagcccttc 
tcttcgatga 
cgtattgctg 
aaccaggcta 
caaatgggga 
atgtctgccc 
ggagcaggaa 
gaatttgcca 
aaacgcaag 

SeqXD 55 

atgagccaaa 

tcttgtcaag 

atggctciaag 

attaaagcaa 

ccacctcaag 

ggaacaacag 

agtgaattta 

agtaatttag 

ttatcaggtt 

aaattggctg 

ttaaaaaaag 

ccaaaagaaa 



ataacgacta 
atgagcttcg 
ctatgattgg 
ctggggtcat 
gtcacattac 
aaaccttcca 
tgattgagaa 
tgcgcgaaaa 

tgragccgat 

aaaacatgaa 
cacctgaaga 
ctggttctgg 
aacagctcat: 
ttaatcceiaa 
ttgaaggaat 
tgccagatat 
tgaiaacag^c 



tgtcttgatt 
tgatgtgaaa 
tcaggaagca 
ccttaccaiag 
tggaaaacca 
cccagaccgc 
agcttctcag 
cacctttgat: 
ggaagacttg 
ggtggatgaa 
gcgtgaaaac 
aaatacattc 
gcagggtgtt 
taaccttcct 
gatgggacaa 
gagccagatg 
catgaaacgt 



gatactgcgg 
gcattggctc 
gccaatgttg 
attgatggcg 
atCGiagttca 
atgtctagcc 
gaatacgatg 
tttaatgatt 
ctcaagatga 
cgccagattg 
ccagatttgt 
gtcgaagtca 
atgtctgggg 
aaaaatatgc 
ggcggtatgc 
tttggtggcg 
atggctaaca 



gtcgtttgca 
aaccaaatga 
cgcgtgagtt 
atactcgtgg 
ctggtacagg 
gtatccttgg 
aacaaaaagc 
tcatcgatca 
ttccaggtat 
ctcgtaaacg 
taaatccciag 
ataaattcat 
atatgctat-aa 
caaatatggg 
Gtgacttatc 
gtttgaaagg 
aaatgaagaa 



gattgatgag 
aatcttgctt 
taatgctcag 
tggtgctgct 
tgaaaagatt 
tatgggggat 
ccttgaaatg 
attagatcag 
ggccaacaat 
tgccattgtg 
ccgtcgccgt 
caaggacttt 
aatgatgaag 
aggaatggat 
agctctcgga 
taaaattggt 
agcgaagaag 



tttggactaa 
ctttacctgg 
cagcttt,aga 
ttcagaaagt 
aaccatatat 
tcattgcatt 
tcaaaaaaat 
atgaaggtct 
acacaagtac 
attttaatat 
cttatagttt 
ttgctcagcg 



agaaaaattt 
tgaagccctt 
ggcaggagca 
agtagattta 
tactgctacg 
tgatgcaact 
aaaagaaaaa 
ctatgcattt 
ciagtgtacaa 
tccggtaatt 
aggtgttacc 
atttattaat 



ataagccaag 
tataatgaag 
gtgggcatt:c 
ccaat:aattg 
atgaaagaag 
ttaagaccaa 
tatccgaatc 
aaatcaggcg 
tcagatgagc 
gccgaaggaa 
agtgtagtca 
gtcatcaaa 



ttcaaggtgg 
aatttagctt 
gcgcaaiattic 
gaattatcaa 
tagatgaact 
gatatgatgg 
aattgctgat 
ttgattttgt 
ctgattttga 
aiaatticatta 
ttggtggagc 



agtcattgtt 
gatgcctttt 
tgtgcgtgat 
aagggattat 
tgtagaatgc 
cttagttgtc 
ggcggatgta 
tggtacaaca 
actaatgaaa 
tccageuicaa 
gattacacgt 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1569 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
699 



SeqID 56 

atgagatatit 

attccagctg 

ggfcggctacg 

ggcgttcgcc 

caccagaaat 

cggaaaatta 

tttgatgatt 

gcagttggtg 

gtcgtggtct 

aagcaaagag 

aaggactata 

gtcgtcggag 

gcaagattgg 

ggctttgagg 

gaagacggtt 

gggcaaccca 

atgagtgtgg 

actgctcttc 

gaaatcctcg 

caccgagact 



taactgcagg 
gacttccatt 
gtcgtggtgg 
acgggaagac 
ggctggacat 
ctcatcctcg 
tgcgaaattc 
cagtagccaa 
ttggtggcaa 
ctgcccagtc 
ttgaccaaat 
gcgttccagt 
ctcaagctgt 
ctggttatcg 
atactcgccg 
tcgttgttcg 
at:atcgaaac 
cagct^gcagg 
ttctc 



atacaaagaa 



agaatcacac 
gacagctgag 
tcgtatgaag 
gacaggggcg 
catgtctgcg 
cccaggtcat 
tttggagcgt 
acgcctcttg 
ggaaatcgat 
tgaagtttct 
caaacgtgat 
tggtcttggt 
tgtctctatc 
taaaggcagc 
taccaataat 
tggggtcatg 
ccacgaacct 
aatggtcatg 
atcagataat 
ct^at 



ggcccccgtc 
gatatcaatg 
attgagaatg 
cctattacta 
gaggacattg 
gccgatttgg 
tcatcagctc 
gctgagctgg 
gttcctgaaa 
attgtcaacc 
ggtgatacca 
tcctatgtcc 
aatgccttta 
caagttatgg 
ctaggtggtt 
aaacccattc 
tacaaggcaa 
gaagcagttg 
cttgaggaac 



taacagctat 
aggaccttiag 
accaggttgt 
tggatgtcat 
aagaccgcct 
ttggggggat 
gtgaaaccac 
atatggagat 
atctgacagt 
aagaacgaga 
tcggtggggt 
aatgggatag 
aaggggtgga 
atgaaattct 
t:t:gaaggtgg 
ctactcttta 
ccgtggagag 
tagcaacggt 
taaaagaagc 



tattgaggga 
acgccgtcag 
ctttacttcg 
caataaggac 
taaaagcaag 
taiagtaccgt 
catgcgggtg 
tgccaaccat 
cgcbgaaatt: 
acaggaaatc 
tgtggagaca 
aaaattggat 
atttggtctt 
ctggtctaaa 
tatgactaat 
taaacctctt 
aagtgatccg 
tctggcgcaa 
ggtagccaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1164 



SeqlD 57 

anggtagtta tgaatagaat aagagtcagc aaaagggttg aiaaagaagct tgctaagggg 60 

ctagttttac tagaagccag tgatcttgag aatgtcaatc ttaaggatca ggaagtagag 120 

gtgcagggtc aggaaggaaa ctttcttggg actgcctacc tttctcagca aaacaagggc 180 

ttgggctggt ttatcagcaa agacaaggtg gccttcaatc aagcttfcctt tgaaacgttg 240 

tttagawiaag ccaaagaaaa gagaaacgcc tactatcaag atgatttgac aactgccttt 300 

cgtctcttta at.caagaggg agatggcttt gggggtctga cagtggacct ttatggcgac 360 

tacgccgtct tttcttggta taactcttat gtttatcaga ttcgtcagac tatatcagaa 420 

gcctttagac aggttttccc tgaggtttta ggagcttatg agaaaatccg ctttaagggt 480 

ttggactatg aatctgccca tgtttatggt caagaagcac ctgacttttt caatgtttta 540 

gaaaatggtg tcctgtatca agtctttatg aatgatggct tgatgacagg aattttccta 600 

gaccagcatg aggttcgcgg tagtttagtt gacggcttgg ctatgggtaa atccttactc 660 

aatatgtttt cctacacagc ggctttttca gtagctgcgg ccatgggagg agctagccat 720 
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acaacttctg ttgatctagc caaacgttca cgagaattgt ctcaagcgca ttttcaggca 780 

aatgggctca gcacagacga gcatcgtttt atagtcatgg atgtctttga gtatttcaaa bjo 

tatgccaaac gcaaagactt gacctacgat gtgattgtcc tagatccgcc tagctttgct 900 

cggaataaaa aacaaacttt ctctgtggcc aaggattatc acaagttgat ttcccagagt 960 

cttgagattt taaatccggg agggattatc attgccagta ccaatgctgc caatgtttcc 1020 

cgtcagaaat ttacagaaca aattgataaa ggttttgcag gaagaagtta ccagatttta lOBO 

aacaaatatg gtcttccagc agattttgcc tataataaaa aagatgaaag tagtaattac 1140 

ctcaaggtga ttagtatgaa ggttagtaaa 1170 

SeqiD 58 ^ ^_ 

atgacaaaaa cattaaaacg tcctgaggtt ttatcacctg cagggacttt agagaagcta 60 

aaggtagctg ttcagtatgg agcagatgct gtctttatcg gtggtcaggc ctatggtctt 120 

cgtagccgtg cgggaaactt tactttcgaa cagatggaag aaggcgtgca gtttgcggcc 180 

aagtatggtg ccaaggtcta tgtagcggct aatatggtta tgcacgaagg aaatgaagct 240 

ggtgctggtg agtggttccg taaactgcgt gatatcggga ttgcagcagt tatcgtatct 300 

gacccagcct tgattatgat tgcagtgact gaagcaccag gccttgaaat ccacctttct 360 

acccaagcca gtgccactaa ctatgaaacc cttgagttct ggaaagagct aggcttgact 420 

cgtgtcgttt tagcgcgtga ggtttcaatg gaagaattag ctgagatccg caaacgtaca 480 

gatgttgaaa ttgaagcctt tgtccatgga gctatgtgta tttcatactc tggacgttgt 540 

actctttcaa accacatgag tatgcgtgat gccaaccgtg gtggatgttc tcagtcatgc 600 

cgttggaaat acgaccttta cgatatgcca tttgggaaag aacgtaagag tttgcagggt 660 

gagattccag aagaattttc aatgtcagcc gttgacatgt ctatgattga ccacattcca 720 

gatatgattg aaaatggtgt ggacagtcta aaaatcgaag gacgtatgaa gtctattcac 780 

tacgtatcaa cagtaaccaa ctgctacaag gcggctgtgg atgcctatct tgaaagtcct 840 

gasiaagtttg aagctatcaa acaagacttg gtggacgaga tgtggaaggt tgcccaacgt 900 

gaactggcta caggatttta ctatggtaca ccatctgaaa atgagcagtt gtttggtgct 960 

cgccgtaaaa ttcctgagta caagttfcgtc gctgaagtgg tttcttatga tgatgcggca 1020 

caaacagcaa caattcgtca acgaaatgtc attaacgaag gggaccaagt tgagttttat 1080 

ggtccaggtt tccgtcattt tgaaacctat attgaagatt tgcatgatgc caaaggcaat 1140 

aaaatcgacc gcgctccaaa tccaatggaa ctattgacta ttaaggtgcc tcaacccgtt 1200 

caatcaggag atatggttcg tgcattaaaa gaaggactca tcaatcttta taaggaagab 1260 

ggaaccagcg tcacagttcg agct 1284 

SeqID 59 

atgaatacct atcaattaaa taatggagta gaaattccag tattgggatt tggaactttt 60 

aaggctaagg atggagaaga agcctatcgt gcagtgttag aagccttgaa ggctggttat 120 

cgtcatattg atacggcggc gatttatcag aatgaagaaa gtgttggtca agcaatcaaa 180 

gatagcggag ttccacgtga agaaatgttc gtaactacca agctttggaa tagtcagcaa 240 

acctatgagc aaactcgtca agctttggaa aaatctatag aaaaactggg cttggattat 300 

ttggatttgt atttgattca ttggccgaac ccaaaaccgc tcagagaaaa tgacgcatgg 360 

aaaactcgca atgcggaagt ttggagagcg atggaagacc tctatcaaga agggaaaatc 420 

cgtgctatcg gcgttagcaa ttttcttccc catcatttgg atgccttgct tgaaactgca 480 

actatcgttc ctgcggtcaa tcaagttcgc ttggcgccag gtgtgtatca agatcaagtc 540 

gtagcttact gtcgtgaaaa gggaatttta ttggaagctt gggggccttt tggacaagga 600 

gaactgtttg atagcaagca agtccaagaa atagcagcaa atcacggaaa atcggttgct 660 

cagatagcct tggcctggag cttggcagaa ggatttttac cacttccaaa atctgtcaca 720 

acctctcgta ttcaagctaa tcttgattgc tttggaattg aactgagtca tgaggagaga 780 

gaaaccttaa aaacgattgc tgttcaatcg ggtgctccac gagttgatga tgtggatttc 840 

SeqXD 60 

ttgagtgaaa agtcaagaga agaagagaaa ttaagcttta aagagcagat tctgagagat 60 

ttagaaaaag taaaaggcta tgatgaagtt ctgaaagaag atgaggcagt agttcgcact 120 

cctgcaaatg aaccttcaac tgaagaactc atggctgatt ccttgtcaac ggtagaggag 180 

attatgagaa aagctcctac cgtgcctact cacccaagtc aaggtgtacc agcttctcca 240 

gcagatgaga ttcaaagaga aactcetggt gttccaagtc atccaagtca agatgtacct 300 

tcttctccag cggaagaaag tggatcaaga ccaggtccag gtcctgttag acctaagaaa 360 

cttgaaagag aatacaatga aaccccaaca agggtagctg tttcctatac gacggcagag 420 

aaaaaagcag aacaagcagg tccagaaaca cctacgcctg ctacagaaac agtggatatc 480 

atcagagata catcacgtcg tagccgtaga gaaggagcaa aacccgttaa gcctaagaaa 540 

gagaagaagt cacatgtgaa agcttttgtg atttcattcc ttgtattcct tgccttgctc 600 

tcagcaggtg gttactttgg ttaccagtac gtgctagatt ccttattacc tatcgatgct 660 

aattctaaga aatatgtgac ggttggaatt ccagaaggtt caaacgttca agaaatcggt 720 

acgacgcttg aaasLagctgg tttggtaaag catggtctga tttttagttt ttatgccaag 780 

tataaaaatt ataccgactt gaaagcaggt tactacaatt tgcaaaagag tatgagtaca 840 

gaagacttac tcaaagagtt gcaaaaaggt ggaacagatg aaccgcaaga acctgtactt 900 

gcgactttga caattccaga aggttatacc ttggatcaga ttgctcaagc tgtgggtcaa 960 

ttgcaaggtg acttcaaaga gtctttgaca gcggaggctt tcttggctaa agttcaagat 1020 

gagacgttta tcagtcaagc agtagcgaaa tatcctactt tactggaaag tttgcctgta 1080 
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aaagacagcg gtgcgcgtta tcgtttggaa ggataccttt tcccagctac atactctatc 1140 

aaggaaagca caactattga gagcttgatt gatgagatgt tagctgctat ggataagaac 1200 

ctatctcctt actatagtac tatcaaatct aaaaacttga ctgtcaatga gttgttgacc 1260 

attgcttcct tggtcgaaaa agaaggtgcc aagacagaag atcgtaagct cattgcaggt 1320 

gtattctaca atcgtttgaa tcgtgatatg ccacttcaaa gtaatattgc aatcttgtat 1380 

gcccaaggaa aactggggca aaatatcagt ctagctgagg atgttgcgat tgataccaac 1440 

attgattcac cttataatgt ttataaaaat gtaggtctca tgcctggtcc agtcgatagt 1500 

ccsiagtctgg atgcgattga gtcaagcatc aatcaaacta agagcgataa cctctacttt 1560 

gtagcagatg tcacagaagg caaggtctac tatgctaaca atcaagaaga ccacgaccgc 1620 

aatgtcgctg aacatgtcaa cagcaaatta aac 1653 

SeqiD 61 

atgaaacatag aacgatttcc attggtgtca gatgacgagg tcatgttgac tgaeiatgcca 60 

gtcat^gaatc tctatgatga gtctgatctg atcagtaata tcaagggtga gtatcgagat 120 

aaaaattatt tagaatgggc tcctattgct gaagaaaaac cagtaaaacc gattgaaaag 180 

caagt:cgaaa aacctaaaaa ggctccttta ggggttaaaa aagaaggaaa gagctatgcg 240 

gaggt:ggcgc gt.gaagaagc gcgtgcggac ttgasiaaaga aacgctctgc taacliaccta 300 

actcaggatt tcagccttgc gagacgtcat tctcagccca gtctagttag acagggcaat 360 

caaccgacag ctcctttcca aaaggaaaat cctggtgaat ttgtcaaata tagccaaaaa 420 

ttgacccagt ctcattatat cttggcggaa gaagttcatt ctatccctac caagaatgaa 480 

gaagtgtcag cacctgctcc aaagsiaaaac aattatgatt ttctaiaagaa gagccaaatc 540 

tacaataaaa aaagtasiaca aacagaacaa gaacgtcggg ttgcccaaga gttgaatctg 600 

accagaatga cageia 615 

SegXD 62 

atgaaaaagt ctsiagagcaa atatctaacc ttggcaggtc ttgtcctggg tacaggagtt 60 

tt.at.t.gagcg cgtgtiggaaa ttctagcacg gcgtcaaaaa cctacaacta tgtttattca 120 

agtgatccat ctagcttgaa ctatctagca gaaaaccgcg cagcaacatc cgatattgtt 180 

gcaaatttgg tagacgggtt attagaaaat gaccaatatg ggaaHattat tccatcatta 240 

gcagaggatt ggactgtttc tcaggacggt ttgacctata cctacaaact tcgtaaggat 300 

gccaagtggt ttacttctga gggagaagaa tatgcgcctg taactgccca ggattttgtg 360 

acaggtttgc aatatgcagc tgataaaaaa tcagaagcct tgtatctagt gcaggactct 420 

gttgctggtt tggatgacta t:at.cact:ggt aaaacaagcg act.tt.tcaac tgtcggtgtc 480 

aaggcacttg atgaccaaac ggttceuitat actttggtta aaccagaact ttactggaat 540 

tcaaaaacac tt^gcaacgat actttttcct gtteiatgcag atittcctgaa atcaaaaggg 600 

gatgattttg ggaaggcgga tccatctagt attttgtaca at:ggaccttt cttgatgaaa 660 

gcacttgtct caaaatctgc tattgaatat aagaaaaacc ctaattactg ggatgctaag 720 

aatgtctttg tagacgatgt gaaattgacc tactatgatg gtagcgacca agaatcactg 780 

gaacgtaatt ttacagctgg tgctt:atact acggctcgtc tttttcctaa cagct:ccagc 840 

tatgaaggga ttaaagaaaa atacaaaaac aatatcatct: atagtatgca aaattcaact 900 

tcatatttct ttaattttaa cctagatagg aagtcttaca attatacttc tataaacaagt: 960 

gacattgaaa agaaatcgac tcaggaagca gttctcaata aaaacttecg tcaggctatc 1020 

aatlititgctt ttgacagaac at:ctt.atggg gctcagtctg aagggaiaaga aggtgcaaca 1080 

aagattttgc gtaacctagt: ggttcctcca aactttgtca gtatcaaggg aaaagacttt 1140 

ggtgaagttg tagcctctaa gatggtcaac tatggtaagg aatggcaagg tatcaacttt 1200 

gcggatggtc aagaccctta ctacaatcct gagaaagcca aggctaagtt tgcggaagct 1260 

aagaaagaac tcgaagcaaa gggtgttcaa ttcccaatcc acttggataa gactgtggaa 1320 

gtatacagata aagtaggcat acaaggagtt agttctatca aacaatcaat tgaatctgtt 1380 

ttaggttctg ataiatgtagt gattgacatt cagcaattaa catcagatga gtttgacagt 1440 

tcaggctact ttgctcaaao agctgctcag aaagattatg atttatatca tggcggttgg 1500 

ggacctgatt atcaagaccc gtcaacctat ctcgatattt ttaatactaa tagtggagga 1560 

tttctgcaaa atcttggact agagcctggt gaggccaatg acaaggctaa ggcagttgga 1620 

ctggatgtct atactcaaat gttggaagaa gctaatasiag agcaagatcc ggccaaacgt 1680 

tatgagaaat atgctgatat tcaagcttgg ttgattgata gttctttagt tcttccaagt 1740 

gtttcgcgtg ggggaacacc atcattgaga agaaccgtac catttgctgc tgcctatggt 1800 

ttaaccggta caaaaggggt tgaatcatat aaatacctca aagtacaaga taagattgtc 1860 

acaacagacg aatatgcaaa agccagagaa aaatggttga aagaaaaaga agaatccaat 1920 

aaaaaagccc aagaagaatt ggcaaaacat gtcaaa 1956 

SeqID 63 

gtggaacagc attcagatgt ctgttacatt ttttatagga gagaaagatt gasiaacaaaa 60 

attggattag caagtatct.g tttactaggc ttggcaacta gtcatgtcgc tgcaaatgaa 120 

actgaagtag caaaaacttc gcaggataca acgacagctt caagtagttc agagcaaaat 180 

cagtcttcta ataaaacgca aacgagcgca gaagtacaga ctaatgctgc tgcccactgg 240 

gatggggatt attatgtaaa ggatgatggt tctaaagctc aaagtgaatg gatttttgac 300 

aactactata aggcttggtt ttatattaat tcagatggtc gttactcgca gaatgaatgg 360 

catggaaatt actacctgaa atcaggtgga tatatggccc aaaacgagtg gatctatgac 420 

agtaattaca agagttggtt ttatctcaag tcagatgggg cttatgctca tcaagaatgg 480 
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caattgattg gaaataagtg gtactacttc aagaagtggg gttacatggc taaaagccaa 540 

tggcaaggaa gttatttctt gaatggtcaa ggagctatga tgcaaaatga atggctctat 600 

gatccagcct attctgctta tttttatcta aaatccgatg gaacttatgc taaccaagag 660 

tggcaaaaag tgggcggcaa atggtactat ttcaagaagt ggggctatat ggctcggaat 720 

gagtggcaag gcaactacta tttgactgga agtggtgcca tggcgactga cgaagtgatt 780 

atggatggta ctcgctatat ctttgcggcc tctggtgagc tcaaagaaaa aaaagatttg 840 

aatgtcggct gggttcacag agatggtaag cgctatttct ttaataatag agaagaacaa 900 

gtgggaaccg aacatgctaa gaaagtcatt gatattagtg agcacaatgg tcgtatcaat 960 

gattggaaaa aggttattga tgagaacgaa gtggatggtg teat tgt teg tctaggttat 1020 

agcggtaaag aagacaagga attggcgcat aacattaagg agttaaaccg tctgggaatt 1080 

ccttatggt.g tctatctcta tacctatgct gaaaatgaga ccgatgctga gagtgaegct 1140 

aaacagacca ttgaacttat suiagaaatac aatatgaacc tgtcttacec tatctattat 1200 

gatgttgaga attgggaata tgtaaataag agcaagagag ctecaagtga tacaggcact 1260 

tgggttaaaa tcatcaacaa gtacatggac acgatgaagc aggcgggtta tcaaaatgtg 1320 

tatgtctata gctatcgtag tttattacag acgcgtttaa aacacccaga tattttaaaa 1380 

catgtaaact gggtagcggc ctatacgaat gctttagaat gggaaaaccc tcattattca 1440 

ggaaaaaaag gttggcaata tacetcttct gaatacatga aaggaatcea agggcgcgta 1500 

gatgtcagcg tttggtat 1518 

SeqXP 64 

atggcaaaag aaccgtggca agaagatatc tatgatcaag aagaatcaag agcagagcgt 60 

cggcatcgaa accacggagg ggctgatagg atggctaatc gtattttgac gatcctaget 120 

agtattttct ttgtaattgt ggtggtgatg gtcatcgttc tcatctatct ateatcgggg 180 

gggagtaatc gcacagcagc cttaaaaggc tttcatgatt ctgatgccag tgtagtacaa 240 

atctcatctt caagtagttc tcagcctgag cagagttcag agccagaatc tacttctagt 300 

agttcagaag aagctgctaa tcctgaagga acgattaaag ttctcgcagg agaaggggaa 360 

gcagctattg ccgctcgtgc aggaatctce attgctcagt tagaggcctt gaatcetggg 420 

caeatggcta eaggatcttg gtttgctaat ecaggtgatg ttataaaaat aaaa 474 

SeqiD 65 

atgccaatta catcattaga aataaaggac aagacttttg gaactcgatt eagaggtttt 60 

gatccagaag aagtcgatga atttttagat attgtggttc gtgattacga agatcttgtg 120 

cgtgcgaatc atgatastaaa tttgcgtatt aagagtttag aagagcgttt gtcttacttt 180 

gatgaaataa aagattcatt gagccagtct gtattgattg ctcaggatac agctgagaga 240 

gtgaaacagg cggcgcatga acgttcaaac aatatcattc atcaagcaga gcaagatgcg 300 

caacgcttgt tggaagaagc taaatataag gcaaacgaga ttcttcgtca agcaaetgat 360 

aatgctaaga aiagtcgctgt tgaaacagaa gaiattgaaga acaagagccg tgtcttccac 420 

eaacgtetca aatctacaat tgagagtcag ttggctattg ttgaatcttc agattgggaa 480 

gatattctcc gtccaacagc tacttatctt caaaccagtg atgaagcctt taaagaagtg 540 

gttagcgaag tacttggaga accgattcca getccaattg aagaagaacc aattgatatg 600 

acacgtcagt tctctcaagc agaaatggca gaattacaag ctegtattga ggtagccgat 660 

aaagaattgt ctgaatttga agctcagatt aaaeaggaag tggaagctcc aactcctgta 720 

gtgagtcctc aagttgaaga agagcctetg ctcatccagt tggcceaatg tatgaagaae 780 

cagaag 786 

SeqID 66 

atgtctttaa aagatagatt cgatagattt atagattatt ttacggagga tgaggattea 60 

agtetccctt atgsiaaaaag agatgagect gtgtttactt cagtaaattc ttcacaggaa 120 

ccggctctcc caatgaatca accttcacag tcggctggca caaaagagaa eaatatcaec 180 

agacttcatg csiagacaaca ggaattggca aatcagagtc agcgtgcaac ggataaggte 240 

attatagatg ttcgttatcc tagaaaatat gaggatgcaa cagaaattgt tgatttattg 300 

gcaggaaacg aaagtatctt gattgatttt cagtatatga cagaggtgea ggctcgtcgt 360 

tgtttggact atttggatgg agcttgtcat gttttagctg gaaatttgaa aaaggtagct 420 

tctaccatgt atttgttgac accagtgaac gttattgtcia atgttgaaga tatcegttta 480 

ccagatgaag ateaacaggg tgagttoggt tttgatatga agcgaaatag agtacga 537 

SeqID 67 

atgtcagatt tgaaaaaata cgaaggtgtc attccagcet tetacgcatg ttatgatgat 60 

caaggagaag taagcccaga acgtaegcgt gccttggttc aatacttcat tgataaaggt 120 

gttcaaggtc tttatgtcaa tggttcttct ggtgaatgta tctaccaaag egttgaagat 180 

cgcaagttga ttttggaaga agtcatggcg gtagccaaag gtaaattgac cattattgcc 240 

catgttgctt gcaataatac taaagatagt atggaacttg ctcgccatgc tgaaagcttg 300 

ggagtagatg etattgcaae gattccacca atttatttcc gcttgccaga atactcagtt 360 

gccaaataet ggaacgatat cagttctgea gctceaaaea cagactacgt gatttacaac 420 

attcctcaat tggcaggggt tgctttgact ccaagccttt acacagaaat gttgaaaaat 480 

cctcgtgtta tcggtgtgaa gaaetcttet atgccagttc aagatatcca aaectttgtc 540 

agccttggtg gagaagacca tatcgtcttt aatggtcctg atgagcagtt cctaggagga 600 

cgcctcatgg gggctagggc tggtatcggt ggtacttatg gtgctatgcc agaaetcttc 660 
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ttgaaactca atcagttgat tgcggataag gacctagaaa cagogcgtga attgcagtat 720 

gctatcaacg caatcattgg taaactcact tctgctcatg gaaatatgta cggtgtcatc 780 

aaagaagtct tgaaaatcaa tgstaggcttg aatattggat ctgttcgttc accattgaca 840 

ccagtgac^g aagaagatcg tccagttgta gaagcggctg ctgccttgat tcgtgaaacc 900 

aaggagcgct tcctc 915 

SeqZD 68 

atgaataaaa gaggtcttta ttcaaaacta ggaatttccg ttgtaggcat tagtctttta 60 

atgggagtcc ccactttgat tcatgcgaat gaattaaact atggtcaact gtccatatcH 120 

cctatttttc aaggaggttc atatceiactg aacetataiaga gtatagatiat: cagctctttg 180 

ttattagata aattgtctgg agagagtcag acagtagtiaa tgaaatttaa agcagataaa 240 

ccaaactctc ttcaagcttt gtttggccta tctaatagta aagcaggctt taaaaataiat 300 

tacttttcaa ttttcatgag agattctggt gagataggtg tagaaatEiag agacgcccaa 360 

^9ggaataa attatttatt ttccagacca gcttcattat ggggaaaaca taaaggacag 420 

gcagttgaaa atacactagt atttgtatct gattcteiaag ataaaacata cacaatgtat 480 

gttaatggeia tagaagtgtt ctctgaaaca gttgatacat ttttgccaat ttcsuiatata 540 

aatggtatag ataaggcaac actaggagct gttaatcgtg aaggtaagga acattacctc 600 

gcaaaaggaa gtattgatga aatcagtcta tttaacaaag caattagtga tcaggaagtt 660 

tcsiactattc ccttgtcaaa tccatttcag ttaattttcc ciatcaggaga ttctactcaa 720 

gctaactatt ttagaatacc gacactatat acattaagta gtggaagagt tctatcaagt 780 

attgatgcac gttatggtgg gactcatgat tctaaiaagta agatteuttat tgccacttct 840 

tatagegatg ataatgggaa aacgtggagt gagccaattt ttgctatgcia gtttutgac 900 

batgaggagc agttagttta ctggccacga gataataaat taaagaatag tcaaattagt 960 

ggaiagt.gctt cattcataga ttcatccatt gttgaagata aaaaatctgg gaaaacgata 1020 

ttactagctg atgttatgcc tgcgggtatt ggaaataata atgcaaataa agccgactca 1080 

ggtt:t;taaag aaataaatgg tcatitattat ttaaaactaa agaagaatgg agatiaacgat 1140 

t.tccgt.tata cagttagaga aaatggtgtc gtttataatg aaacaactaa taaacctaca 1200 

aattatacta taaatgataa gtatgeiagtt ttggagggag gaaiagtcttt aacagtcgaa 1260 

caatattcgg ttgattttga tagtggctct ttaagagaaa ggcataatgg aaaacaggtt 1320 

cctatgaatg ttttctacaa agattcgtta tttaaagtga ctcctactaa ttatatiagca 1380 

atgacaacta gtcagaatag aggagagagt tgggaacaat ttaagttgtt gcctccgttc 1440 

ttaggagaaa aacataatgg aacttactta tgtcccggac aaggtttagc attaaaatca 1500 

agtaacagat tgatttttgc aacatatact agtggagaac taacctatct catttctgat 1560 

gatagtggtc aaacatggaa gaaatcctca gcttcaattc cgtttaaaaa tgcaacagca 1620 

gaagcacaaa tggttgaact gagagatggt gtgattagaa cattctttag aaccactaca 1680 

ggtaagatag cttatatgac tagtagagat tctggagaaa catggt,cgaa agtttcgtat 1740 

at:t.gat:ggaa tccaacaaac ttcatatggc acacaagtat ctgcaattaa atactcucaa 1800 

ttaattgatg gaaaagaagc agtcattttg agtacaccaa attctagaag tggccgcaag 1860 

ggaggccaat tagttgtcgg tttagtcaat aaagaagatg atagt.attga tt.ggaaatac 1920 

cactatgata ttgatttgcc ttcgtatggt tatgcctatt ctgcgattac agaattgcca 1980 

aatcaticaca taggtgtact gtttgaaaaa tatgattcgt ggtcgagasta tgaattgcat 2040 

ttaagcaatg tagttcagta tatagatttg gaaattaatg atttaacaaa a 2091 

SegXD 69 

atgaatcgga gtgttcaaga acgtaagtgt cgttatagca ttaggaaact atcggtagga 60 

gcggtttcta tgattgtiagg agcagtggtia tttggaacgt ctcctgtttt agctcaagaa 120 

ggggcaagtg agcaacctct ggcaaatgaa actcaacttt cgggggagag ctcaacccta 180 

actgatacag aaaagagcca gccttcttca gagactgaac tttctggcaa taagcaagaa 240 

caagaaagga aagataagca agaagaaaaa attccaagag attactatgc acgagatttg 300 

gaaaatgtcg aaacagtgat agaaaaagaa gatgttgaaa ccaatgcttc aaatggtcag 360 

agagttgatt tatcaagtga actagataaa ctaaagaaac ttgaaaacgc aacagttcac 420 

atggagttta agccagatgc caaggcccca gcattctata atctcttttc tgtgtcaagt 480 

gctactaaaa aagatgagta cttcactatg gcagtttaca ataatactgc tactctagag 540 

gggcgtggtt cggatgggaa acagttttac aiataattaca acgatgcacc cttaaaagtt 600 

aaaccaggtc agtggaattc tgtgactttc acagttgaaa aaccgacagc agaactacct 660 

aaaggccgag tgcgcctcta cgtaaacggg gtattatctc gaacaagtct gagatctggc 720 

aatttcatta aagatatgcc agatgtaacg catgtgcaaa tcggagcaac caagcgtgcc 780 

aacaatacgg tttgggggtc aaatctacag attcggaatc tcactgtgta taatcgtgct 840 

ttaacaccag aagaggtiaca aiaaacgtagt caacttttta aacgctcaga tttagaaaaa 900 

aaactacctg aaggagcggc tttaacagag aaaacggaca tattcgaaag cgggcgtaac 960 

ggtaacccaa ataaagatgg aatcaagagt tatcgtattc cagcacttct caagacagat 1020 

aaaggaactt tgatcgcagg tgcagatgaa cgccgtctcc attcgagtga ctggggtgat 1080 

atcggtatgg tcatcagacg tagtgaagat aatggtaaaa cttggggtga ccgagtaacc 1140 

attaccaact tacgtgacaa tccaaaagct tctgacccat cgatcggttc accagtgaat 1200 

atcgabatgg tgttggttca agatcctgaa accaasusgaa tcttttctat ctatgacatg 1260 

ttcccagaag ggaagggaat ctttggaatg tcttcacaaa aagaagaagc ctacaaaaaa 1320 

atcgatggaa aaacctatca aatcctctac cgtgaaggag aaaagggagc ttataccatt 1380 

cgagaaaatg gtactgtcta tacaccagat ggtaaggcga cagactatcg cgttgttgta 1440 
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gatcctgtta aaccagccta tagcgacaag ggtgatctat acaagggtga ccaattacta 1500 

ggaaatatct acttcacaac aaacaaaact tctccattta gaattgccaa ggatagctat 1560 

ctatggatgt cctacagtga tgacgacggg aagacatggt cagctcctca agatattact 1620 

ccgatggtca aagccgattg gatgaaattc ttgggtgtag gtcctggaac aggaattgta 1680 

cttcggaatg ggcctcaccLa gggacggatt ttgataccgg tttatacgac taataatgta 1740 

tctcacttag atggctcgca atcttctcgt gtcatctatt cagatgatca tggaaaaact 1800 

tggcatgctg gagaagcggt caacgataac cgtcaggtag acggtcaaaa gatccactct i860 

tctacgatga acaatagacg tgcgcaaaat acagaatcaa cggtggtaca actaaacaat 1920 

ggagatgtta aactctttat gcgtggtttg actggagatc ttcaggttgc tacaagtaaa 1980 

gacggaggag tgacttggga gaaggatatc aaacgttatc cacaggttaa agatgtctat 2040 

gttcaaatgt ctgctatcca tacgatgcac gaaggaaaag aatacatcat cctcagtaat 2100 

gcaggtggac cgaaacgtga aaatgggatg gtccacttgg cacgtgtcga agaaaatggt 2160 

gagttgactt ggctcaaaca caatccaatt caaaaaggag agtttgccta taattcgctc 2220 

caagaattag gaaatgggga gtatggcatc ttgtatgaac atactgaaaa aggacaaaat 2280 

gcctataccc tatcatttag aaaatttaat tgggaatttt tgagcaaaaa tctgatttct 2340 

cctaccgaag cgaactagag agatgggcaa aggagagatg ggcaaaggag ttattggctt 2400 

ggagttcgac tcagaagtat tggtcaacaa ggctccaacc cttcaattgg caaatggtaa 2460 

aacagcgact ttcctaaccc agtatgatag caagaccttg ttgtttgcag tagataagga 2520 

agatatcgga caggaaatta ttggtatagc taaaggaagc atcgaaagta tgcataatct 2580 

tcctgtaaat ctagcaggtg ccagagttcc tggcggagta aatggtagca aagcagcggt 2640 

gcatgaagtt ccagaattta cagggggagt taatggtaca gagccagctg ttcatgaaat 2700 

cgcagagtat aagggatctg attcgcttgt aactcttact acaaaaaaag attatactta 2760 

caaagctccfc cttgctcagc aggcacttcc tgaaacagga aacaaggaga gtgacctcct 2820 

agcttcacta ggactaacag ctttcttcct tggtctgttt acgctaggga aaaagagaga 2880 

acaa 2®®^ 

SeqID 70 

atgatccaaa tcggcaagat ttttgccgga cgctatcgga ttgtcsiaaca gattggtcga 60 

ggaggtatgg cggatgtcta cctagccaaa gacttaatct tagatgggga agaagtggca 120 

gtgaaggttc tgaggaccaa ctaccagacg gacccgatag ctgtagctcg ttttcagcgt 180 

gaagcgagag ctatggcaga tctagaccat cctcatatcg ttcggataac agatattggc 240 

gaggaagacg gtcaacagta cctagctatg gagtatgtgg ctggactgga cctcaaacgc 300 

tatatcaagg aacattatcc tctttctaat gaagaagcag tccgtatcat gggacaaatt 360 

ctcttggcta tgcgcttggc ccatactcga ggaattgttc acagggactt gaaacctcaa 420 

aatatcctct tgacaccaga tgggactgcc aaggtcacag actttgggat tgctgtagcc 480 

tttgcagaga caagtctgac ccagactaac tcgatgttgg gctcagttca ttacttgtca 540 

ccagagcagg cgcgtggttc gaaggcgact gtgcagagtg atatctatgc catggggatt 600 

attttctatg agatgctgac aggccatatc ccttatgacg gggatagcgc ggtgaccatt 660 

gccctccagc atttccagaa acccctgccg tccgttattg cagaasLatcc atctgtacct 720 

caggctttag aaaatgttat tatcaaggca actgctaaaa agttgaccaa tcgctaccgc 780 

tcggtttcag agatgtatgt ggacttgtct agtagcttgt cctacaatcg tagaaatgaa 840 

agtaagttaa tctttgatga aacgagcaag gcagatacca agaccttgcc gaaggtttct 900 

cagagtacct tgacatctat tcctaaggtt caagcgcaaa cagaacacaa atcaatcaaa 960 

aacccaagcc aggctgtgac agaggaaact taccaaccac aagcaccgaa aaiaacataga 1020 

tttaagatgc gttacctgat tttgttggcc agccttgtat tggtggcagc ttctcttatt 1080 

tggatactat ccagaactcc tgcaaccatt gccattccag atgtggcagg tcagacagtt 1140 

gcagaggcca aggcaacgct caaaaaagcc aattttgaga ttggtgagga gaagacagag 1200 

gctagtgaaa aggtggaaga agggcggatt atccgtacag atcctggcgc tggaactggt 1260 

cgaaaagaag gaacgaaaat caatttggtt gtctcatcag gcaagcaatc tttccaaatt 1320 

agtaattatg tcggtcggaa atcctctgat gtcattgcgg aattaaaaga gaaaaaagtt 1380 

ccagataatt tgattaaaat tgaggaagaa gagtcgaatg agagtgaggc tggaacggtc 1440 

ctgaagcaaa gtctaccaga aggtacgacc tatgacttga gcaaggcaac tcaaattgtt 1500 

ttgacagtag ctaaaaaagc tacgacgatt caattaggga actatattgg acggaactct 1560 

acagaagtaa tctcagaact caagcagaag aaggttcctg agaatttgat taagatagag 1620 

gaagaagagt ccagcgaaag cgaaccagga acgattatga aacaaagtcc aggt.gccg9a 1680 

acgacttatg atgtgagtaa acctactcaa attgtcttga cagtagctaa aaaagttaca 1740 

agtgttgcca tgccgagtta cattggttct agcttggagt ttactaagaa caatttgatt 1800 

caaattgttg ggattaagga agctaatata gaagttgtag aagtgacgac agcgcctgca 1860 

ggtagtgcag aaggcatggt tgttgaacaa agtcctagag caggtgaaaa ggtagacctc 1920 

aataagacta gagtcaagat ttcaatctac aaacctaaaa caactt:cagc tactcct 1977 

SeqID 7X 

atgacaaaac taatctttat ggggaccccc gacttttcag caacagtctt aaaaggactt 60 

ttgacagatg accgttacga aattctagcc gttgtgaccc agccagaccg tgctgttggt 120 

cgt.aaaaaag btatccaaga aaccccagtc aagcaggctg ccaaggaagc aggactatct 180 

atctaccaac ctgaaaaatt atctggaagt ccagagatgg aagatcttat gaagctagga 240 

gcagatggcia ttgtgactgc tgcttttggg cagtttctcc caagcaaact ccttgatagc 300 

atggactttg ctgtcaacgt tcatgcctcc ctccttccta gacaccgtgg tggtgcgcct 360 
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atccattatg ccttgattca aggggatgag gaagctggtg tgaccatcat ggaaatggtt 420 

aaggaaatgg atgcaggaga tatgatttct cgtcgcagca ttccgatcac agatgaggac 480 

aatgttggca ccttgtttga aaaattggcg ctagttggtc gtgatttgct tttggacact 540 

ctgcctgcct atattgctgg tgatatcaaa cctgaaccgc aggatacgag tcaggttacc 600 

ttctctccaa atataaagcc agaggaagaa aaactggact ggaacaaaac caatcgtcaa 660 

ctctttaacc aaattcgtgg aatgaacccc tggcctgttg cccatacttt ccttaagggc 720 

gaccgcttta agatttatga agccctacca gtagaaggtc agggaaatcc aggtgagatt 780 

ctctctatcg gcaagaaaga attgattgtc gcaacggctg aaggggctct atccctcaaa 840 

caagtgcagc cagctggtaa gcctaagatg gacattgctt ccttcctcaa cggagttgga 900 

cgtacattga ctgtaggaga acgatttggt gac 933 

SegID 72 

gtgtttagac gtttaggtca agatttccag cttaggaaag tgaaaaagat tttaaagcag 60 

attaatgccc tgaaaggcaa gatgtcctct ctttcggatc aagaattagt agctaaaaca 120 

gtagagtttc gtcagcgtct ttccgaggga gaaagtctag acgatatttt ggttgaagct 180 

tttgctgtgg tgcgtgaagc agataagcgg attttaggga tgtttcctta tgatgttcaa 240 

gtcatgggag ctattgtcat gcactatgga aatgttgctg agatgaatac SSSSS^SS^ 300 

aagaccttga cagctaccat gcctgtctat ttgaacgctt tttcaggaga aggagtgatg 360 

gttgtgactc ctaatgagta tttatcaaag cgtgatgccg aggaaatggg tcaagtttat 420 

cgttttctag gattgaccat tggtgtacca tttacggaag atccaaagaa ggagatgaaa 480 

gctgaagaaa agaagcttat ctatgcttcg gatatcatct acacaaccaa tagtaattta 540 

ggttttgatt atctaaatga taacctagcc tcgaatgaag aaggtaagtt tittacgaccg 600 

tttaactatg tgattattga tgaaattgat gatatcttgc ttgatagtgc acaaactcct 660 

ctgattattg cgggttctcc tcgtgttcag tctaattact atgcgatcat tgatacactt 720 

gtaacaacct tggtcgaagg agaggattat atctttaaag aggagaaaga ggaggtttgg 780 

ctcactacta agggggccaa gtctgctgag aatttcctag ggattgataa tittatacaag 840 

gaagagcatg cgtcttttgc tcgtcatttg gtttatgcga ttcgagctca taagctcttt 900 

actaaagata aggactatat cattcgtgga aatgagatgg tactggttga taagggaaca. 960 

gggcgtctaa tggaaatgac taaacttcaa ggaggtctcc atcaggctat tgaagccaag 1020 

gaacatgtca aattatctcc tgagacgcgg gctatggcct cgatcaccta tcagagtctt 1080 

tttaagatgt ttaataagat: atctggtatg acagggacag gtaaggtcgc ggaaaaagag 1140 

tttattgaaa cttacaatat gtctgtagta cgcattccaa ccaatcgtcc gagacaacgg 1200 

attgactatc cagataatct atatatcact ttacctgaaa aagtgtatgc atccttggag 1260 

tacatcaagc aataccatgc taagggaaat cctttactcg tttttgtagg ctcagttgaa 1320 

atgtctcaac tctattcgtc tctcttgttt cgtgaaggga ttgcccataa tgtcctaaat 1380 

gctaataatg cggcgcgtga ggctcagatt atctccgagt caggtcagat gggggctgtg 1440 

acagtggcta cctctatggc aggacgtggt acggatatca agcttggtaa aggagtcgca 1500 

gagcttgggg gcttgattgt tattgggact gagcggatgg aaagtcagcg gatcgaccta 1560 

caaattcgtg gccgttctgg tcgtcaggga gatcctggta tgagtaaatt ttttgtatcc 1620 

ttagaggatg atgttatcaa gaaatttggt ccatcttggg tgcataaaaa gtacaaagac 1680 

tatcaggttc aagatatgac tcaaccggaa gtattgaaag gtcgtaaata ccggaaacta 1740 

gtcgaaaagg ctcagcatgc cagtgatagt gctggacgtt cagcacgtcg tcagactctg 1800 

gagtatgctg aaagtatigaa tatacaacgg gatatagtct ataaagagag aaatcgtcta 1860 

atagatggtt ctcgtgactt agaggatgtt gttgtggata tcattgagag atatacagaa 1920 

gaggtagcgg ctgatcacta tgctagtcgt gaattattgt ttcactttat tgtgaccaat 1980 

attagttttc atgttaaaga ggttccagat tatatagatg taactgacaa aactgcagtt 2040 

cgtagcttta tgaagcaggt gattgataaa gaactttctg aaaagaaaga attacttaat 2100 

caacatgact tatatgaaca gtttttacga ctttcactgc ttaaagccat tgatgacaac 2160 

tgggtagagc aggtagacba tctacaacag ctatccatgg ctatcggtgg tcaatctgct 2220 

agtcagaaaa atccaatcgt agagtactat caagaagcct: acgcgggctt tgaagctatg 2280 

aaagaacaga tfccatgcgga tatggtgcgt aatctcctga tggggctggt tgaggtcact 2340 

ccaaaaggbg aaatcgtgac tcattttcca 2370 

SeqID 73 

atgaccgaaa cggtagaaga taaagtaagt cattcaatta ctgggcttga tatcctcaag 60 

gggatagttg ctgcgggagc tgtcataagt ggaaccgttg caactcaaac gaaggtattt 120 

acaaatgagt cagcagtact tgaaaaaact gtagagaiaaa cggatgcttt ggcaacaaat 180 

gatacagtag ttctaggtac gatatctaca agtaattcag cgagttcaac tagtttgtca 240 

gcttcagagt cggcaagtac atctgcatct gagtcagcct caaccagcgc ttcgacctca 300 

gcaagtacaa gtgcatcaga atcagcaagt acatcggctt cgacaagtat ttctgcatca 360 

tctactgtgg taggttcaca aacagctgcc gctacagaag caactgctaa gaaggtcgaa 420 

gaagatcgta agaaaccagc tagtgattat gtagcatcag ttacaaatgt caatctccaa 480 

tcttatgcta agcgacgcaa gcgttcagtg gattccatcg agcciattgct ggcttctata 540 

aaaaatgctg ctgttttttc tggcaatacg attgtaaatg gcgcccctgc aattaatgca 600 

agtctaaaca t:tgct.aaaag tgagacaaaa gtttatacag gtgaaggtgt agattcggta 660 

Catcgtgttc caatttacta taaattgaaa gtgacaaatg atggtccaaa attgaccttt 720 

acctatacgg ttacgtatgt gaatcctaaa acaaatgatc ttggtaatat: atcaagtatg 780 

cgtcctggat attctatcta taattcaggt acttcaacac aaacaiatgtt. aacccttggc 840 
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agtgatcttg gtaaaccttc aggtgtaaag aactacatta ctgacaaaaa tggtagacag 900 
gttctatcct ataatacatc tacaatgacg acgcagggta gtgggtatac ttggggaaat 960 

ggtgcccaaa tgaatggttt ctttgctaag aaaggatatg gattaacatc atcttggact 1020 

gtaccaatta ctggaacgga tacatccttt acatttaccc cttacgctgc tagaacagat 1080 

agaattggaa ttaactactt caatggtgga ggaaaggtag ttgaatctag cacgaccagt 1140 

cagtcacttt cacagtctaa gtcactctca gtaagtgcta gtcaaagcgc ctcagcttca 1200 

gcatcaacaa gtgcgtcggc ttcagcatca accagtgcct cggcttcagc gtcaaccagt 1260 

gcgtcagctt cagcaagtac cagtgcttca gtctcagcat caacaagtgc ttcagcctca 1320 

gcatcgacaa gtgcctcggc ttcagcaagc acatcagcat ctgaatcagc gtcaaccagt 1380 

gcttcggctt cagcaagtac cagtgcttca gcttcagcat caaccagcgc ctcggcctca 1440 

gcaagcacct cagcttctga atcggcctca accagcgcct cggcctcagc aagcacctca 1500 

gcttctgaat cggcctcaac cagcgcctca gcctcagcat caacgagtgc ttcggcttca 1560 

gcaagcacaa gcgcctcggg ttcagcatca acgagtacgt cagcttcagc gtcaaccagt 1620 

gcttcagcct cagcatcaac aagtgcgtca gcctcagcaa gtatctcagc gtctgaatcg 1680 

gcatcaacga gtgcgtctga gtcagcatca acgagtacgt cagcctcagc aagcacctca 1740 

gcttctgaat cggcctcaiac cagtgcgtca gcctcagcat cgacaagcgc ctcagcttca 1800 

gcaiagtacca gtgcttcagc ctcagcgtcg acaagtgcgt cggcctcaac cagtgcatct 1860 

gaatcggcat caaccagtgc gtcagcctca gcaagtacta gtgcatcggc ttcagcatca 1920 

accagtgcct cggcttcagc gtcaaccagt gcgtcagctt cagcaagtac cagtgcttca 1980 

gtctcagcat caacaagtgc ttcagcctca gcatcgacsua gtgcctcggc ttcagcaagc 2040 

acatcagcat ctgaatcagc gtcgacaagc gcctcagctt cagcaagtac cagtgcgtca 2100 

gcctcagcgt cgacaagtgc gtcagcctca gcaagtacta gtgcatcagc ttcagcatca 2160 

acgagtgcat cggcttcggc gtcaaccagt gcatcagagt cagcaagtac cagtgcgtca 2220 

gcttccgcat caacaagtgc ctcggcttca gcaagcacca gtgcgtcggc ttcagcaagt 2280 

actagcgcct cagcctcagc ctcaaccagt gcgtcagcct cagcaagtat ctcagcgtct 2340 

gaatcggcat caacgagtgc gtccgcttca gcaagtacta gcgcctcagc ctcagcgtca 2400 

acaagtgcat cggcttcagc gtcaacgagt gcgtctgaat cggcatcaac gagtgcgtcc 2460 

gcttcagcaa gtactagcgc ctcagcctca gcgtcaacaa gtgcatcggc ttcagcatca 2520 

acgagtgcgt ccgcttcagc aagtactagc gcctcagcct cagcgtcaac aagtgcatcg 2580 

gcttcagcgt caacgagtgc gtctgagtca gcatcaacga gtgcgtcagc ctcagcaagc 2640 

acatcagctt ctgaatctgc atcaaccagt gcgtcagcct cagcatcgac aagcgcctca 2700 

gcttcagcaa gtaccagtgc gtcagcctca gcgtcgacaa gtgcgtcggc ttcagcaagt 2760 

accagtgcgt cagcctcagc aagtaccagt gcgtcagcct cagcgtcgac aagtgcgtcg 2820 

gcctcaacca gtgcatctga atcggcatca accagtgcgt cagcctcagc aagtactagt 2880 

gcatcagctt cagcatcaac gagtgcatcg gcttcagcat caaccagtgc atcagagtca 2940 

gcaagtacca gtgcgtcagc ttccgcatca acaagtgcct cggcttcagc aagtactagc 3000 

gcctcagcct cagcgtcaac aagtgcttca gcttccgcgt caaccagcgc ctcggcctca 3060 

gcaagtatct cagcgtctga atcggcatca acaagtgcct cggcttcagc atcaacgagt 3120 

gcatcagtct cagcaagcac cagtgcgtcg gcctcagcaa gcaccagcgc gtctgaatcc 3180 

gcatcaacca gtgcctcagc ttcagcaagt acctcagcat ctgaatcagc atcaacaagt 3240 

gcctcggctt cagcaagcac aagtgcttca gcctcagcaa gtatctcagc gtctgaatcg 3300 

gcatcaacga gtgcgtccgc ttcagcaagt actagcgcct cagcatcagc gtcaacaagt 3360 

gcttcggctt cagcgtcaac gagtgcgtct gagtcagcat caacgagtac gtcagcctca 3420 

gcaagcacat cagcttctga atctgcatca accagtgcgt cagcctcagc atcgacaagc 3480 

gcctcagctt cagcaagtac cagtgcgtca gcctcagcaa gtaccagtgc ttcagcctca 3540 

gcgtcgacaa gtgcgtcggc ctcaaccagt gcatctgaat cggcatcaac cagtgcgtca 3600 

gcctcagcaa gtactagcgc ctcagcctca gcatcaacga gtgcgtccgc ttcagcaagt 3660 

actagtgcat cagcttcagc aagtactagc gcctcagcct cagcgtcgac aagcgcctca 3720 

gcttcagcaa gtaccagtgc gtcagcctca gcgtcgacaa gtgcgtcggc ttcagcaagt 3780 

acctcagcgt ctgaatcagc atcaacaagt gcgtcggctt cagcatcaac gagtgcatca 3840 

gcttcagcat caacaagtgc ttcagcttca gcoiagtacca gtgcgtcggc ttcagcatca 3900 

acgagtgctt cagtctcagc gtcaaccagt gcctctgaat ccgcatcaac aagtgcctcg 3960 

gcttcagcaa gcaccagtgc ttcggcttca gcgtcaacga gtgcgtctga gtcagcatca 4020 

acgagtgcgt cagcctcagc aagcacatca gcttctgaat ctgcatcaac cagtgcgtca 4080 

gcttccgcat caacaagcgc ctcggcctca gcaagtacaa gtgcttcagc ctcagcatca 4140 

accagtgcat cagcttcagc ctcaacaagt gcttcagcct cagcgtcaac cagtgcctcg 4200 

gcttcagcaa gtaccagtgc gtcagcttca gcaagcacaa gtgcgtcagc ttcagcatca 4260 

accagtgctt cggcttcggc atcaacaagt gcctcagcat cagcatcaac gagtgcgtca 4320 

gcctcagcaa gtactagtgc atcagcatca gcatcaacca gtgcatcagc ctcagcaagt 4380 

atctcagcgt ctgaatcggc atcaacgagt gcatcagcat cagcatcaac gagtgcatcg 4440 

gcttcagcgt caaccagtgc atcagtctca gcaagcacca gtgcgtcggc ttcagcatca 4500 

acgagtgcct cagcctcagc aagtatctca gcgtctgaat cggcatcaac gagtgcgtca 4560 

gcctcagcaa gtactagtgc atcggcttca gcaagcacca gtgcgtcggc ttcagcatca 4620 

accagtgcct cagcctcagc aagtatctca gcgtctgaat cggcatcaac gagtgcgtca 4680 

gcctcagcaa gtactagtgc atcagcatca gcatcaacga gtgcatcggc ttcagcciagt 4740 

accagcgcct cagcttcagc aagcaccagt gcgtcagcct cagcaagtac cagcgcctca 4800 

gcctcagcaa gcaccagtgc ctcagcttca gcaagtacca gtgcgtcagc ctcagcgtcg 4860 

acaagtgcgt cggcttcagc aagtacctca gcgtctgaat cagcatcaac gagtgcatca 4920 
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gcttcagcat caacaagtgc ttcagcttca gcaagtacca gtgcgtcggc ttcagcatca 4980 

acgagtgctt cagtctcagc gtcaaccagt gcctctgaat cagcatcstac aagtgcctcg 5040 

gcttcagcaa gcaccagtgc gtcggcttca gcaagtacta gtgcatcggc ttcagcatcg 5100 

acaagtgcgt ctgaatcggc atcaacgagt gcttcggctt cagcatcaac gagtgcgtca 5160 

gcctcagcaa gcacatcagc ttctgaatct gcatcaacca gtgcgtccgc ttcagcgtca 5220 

accagtgcgt cggcttcagc gtcgacsiagt gcttcggctt cagcatcaac gagtgcgtcg 5280 

gcctcagcaa gcgcaagtac ctcagcgtca gcttccgcct caaccagtgc gtcggcttca 5340 

gcaagcacaa gtgcgtcagc ctcagcaagt atctcagcgt ctgaatcggc atcaacgagt 5400 

gcgtctgagt cagcatcaac gagtacgtca gcctcagcaa gcacatcagc ttctgaatct 5460 

gcatcaacca gtgcgtcagc ctcagcatcg acaagcgcct cagcttcagc aagtaccagt 5520 

gcttcagcct cagcgtcgac aagtgcgtcg gcctcaacca gtgcatctga atcggcatca 5580 

accagtgcgt cagcctcagc aagtactagt gcatcagctt cagcatcstac gagtgcatcg 5640 

gcttcagcat caaccagtgc ctcggcttca gcgtcaacca gtgcgtcagc ttcagcaagt 5700 

accagtgctt cagtctcagc atcaacaagt gcttcagcct cagcatcgac aagtgcctcg 5760 

gcttcagcaa gcacatcagc atctgaatca gcgtcgacaa gcgcctcagc ttcagcaagt 5820 

accagtgcgt cagcctcagc gtcgacaagt gcgtcagcct cagcaagtac tagtgcatca 5880 

gcttcagcat csiacgagtgc atcggcttcg gcgtcaacca gtgcatcaga gtcagcaagt 5940 

accagtgcgt cagcttccgc atcaacaagt gcctcggctt cagcaiagcac cagtgcgtcg 6000 

gcttcagcaa gtactagcgc ctcagcctca gcctcaacca gtgcgtcagc ctcagcaagt 6060 

atctcagcgt ctgaatcggc atcaacgagt gcgtccgctt cagcaagtac tagcgcctca 6120 

gcctcagcgt caacaagtgc atcggcttca gcgtcaacga gtgcgtctga atcggcatca 6180 

acgagtgcgt ccgcttcagc aagtactagc gcctcagcct cagcgtcaac aagtgcatcg 6240 

gcttcagcat caacgagtgc gtccgcttca gcaagtacta gcgcctcagc ctcagcgtca 6300 

acaagtgcat cggcttcagc gtcaacgagt gcgtctgagt cagcatcaac gagtgcgtca 6360 

gcctcagcaa gcacatcagc ttctgaatct gcatcaacca gtgcgtcagc ctcagcatcg 6420 

acaagcgcct cagcttcagc aagtaccagt gcgtcagcct cagcgtcgac aagtgcgtcg 6480 

gcttcagcaa gtaccagtgc gtcagcctca gcaagtacca gtgcgtcagc ctcagcgtcg 6540 

acaagtgcgt cggcctcaac cagtgcatct gaatcggcat ccuiccagtgc gtcagcctca 6600 

gcaagtacta gtgcatcagc ttcagcatca acgagtgcat cggcttcagc atcaaccagt 6660 

gcatcagagt cagcaagtac cagtgcgtca gcttccgcat caacaagtgc ctcggcttca 6720 

gcaagtacta gcgcctcagc ctcagcgtca acaagtgctt cagcttccgc gtcaaccagc 6780 

gcctcggcct cagcaagtat ctcagcgtct gaatcggcat caacaagtgc ctcggcttca 6840 

gcatcaacga gtgcatcagt ctcagcaagc accagtgcgt cggcctcagc aagcaccagc 6900 

gcgtctgaat ccgcatcaac cagtgcctca gcttcagcaa gtacctcagc atctgaatca 6960 

gcatcaacaa gtgcatcggc ttcagcaagc acaagtgctt cagcctcagc aagtatctca 7020 

gcgtctgaat cggcatcaac gagtgcgtcc gcttcagcaa gtactagcgc ctcagcatca 7080 

gcgtcaacaa gtgcttcggc ttcagcgtca acgagtgcgt ctgagtcagc atcaacgagt 7140 

acgtcagcct cagcaagcac atcagcttct gaatctgcat caaccagtgc gtcagcctca 7200 

gcatcgacaa gcgcctcagc ttcagcaagt accagtgcgt cagcctcagc aagtaccagt 7260 

gcttcagcct cagcgtcgac aagtgcgtcg gcctcaacca gtgcatctga atcggcatca 7320 

accagtgcgt cagcctcagc aagtactagc gcctcagcct cagcatcaac gagtgcgtcc 7380 

gcttcagcaa gtactagtgc atcagcatca gcatcaacga gtgcatcggc ttcagcaagt 7440 

accagcgcct cagcttcagc aagcaccagt. gcgtcagcct cagcaagtac cagcgcctca 7500 

gcctcagcaa gcaccagtgc ctcagcttca gcaagtacca gtgcgtcagc ctcagcgtcg 7560 

acaagtgcgt cggcttcagc aagtacctca gcgtctgaat cagcatcaac gagtgcatca 7620 

gcttcagcat caacaagtgc ttcagcttca gcaagtacca gtgcgtcggc ttcagcatca 7680 

acgagtgctt cagtctcagc gtcaaccagt gcctctgaat cagcatcaac aagtgcctcg 7740 

gcttcagcaa gcaccagtgc gtcggcttca gcaagtacta gtgcatcggc ttcagcatcg 7800 

acaagtgcgt ctgaatcggc atcaacgagt gcttcggctt cagcatcstac gagtgcgtca 7860 

gcctcagcaa gcacatcagc ttctgaatct gcatcaacca gtgcgtccgc ttcagcgtca 7920 

accagtgcgt cggcttcagc gtcgacaagt gcttcggctt cagcatcaac gagtgcgtcg 7980 

gcctcagcaa gcgcaagtac ctcagcgtca gcttccgcct caaccagtgc gtccgcttca 8040 

gcaagcacaa gtgcgtcagc ctcagcaagt atctcagcgt ctgaatcggc atcaacgagt 8100 

gcgtcggcct cagcaagcgc aagtacctca gcgtcagctt ccgcctcaac cagtgcgtcg 8160 

gcttcagcaa gcacaagtgc gtcagcctca gcaagtatct cagcgtctga atcggcatca 8220 

acgagtgcgt ctgagtcagc atcaacgagt acgtcagcct cagcaagcac atcagcttct 8280 

gaatcggcat caaccagtgc gtcagcctca gcatcgacaa gcgcctcagc ttcagcaagt 8340 

accagtgctt cagcctcagc gtcgacaagt gcgtcggcct caaccagtgc atctgaatcg 8400 

gcatcaacca gtgcgtcagc ctcagcaagt actagtgcat cagcttcagc atcaacgagt 8460 

gcatcggctt cagcatcaac cagtgcctcg gcttcagcgt caaccagtgc gtcagcttca 8520 

gcaagtacca gtgcttcagt ctcagcatca acaagtgctt cagcctcagc atcgacaagt 8580 

gcctcggctt cagcaagcac atcagcatct gaatcagcgt cgacaagtgc gtcggcctca 8640 

accagtgcat ctgaatcggc atcaaccagt gcgtcagcct cagcaagtac tagtgcatca 8700 

gcttcagcat caacgagtgc atcggcttcg gcgtcaacca gtgcatcaga gtcagcaagt 8760 

accagtgcgt cagcttccgc atcaacaagt gcctcggctt cagcaagcac atcagcatct 8820 

gaatcagcgt caaccagtgc ttcggcttca gcaagtacca grtgcttcagc ttcagcatca 8880 

accagcgcct cggcctcagc aagcacctca gcttctgaat cggcctcaac cagcgcctcg 8940 

gcctcagcaa gcacctcagc ttctgaatcg gcctcaacca gcgcctcagc ctcagcatca 9000 
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acgagtgctt cggcttcagc aagcacaagc 
gcttcagcgt caaccagtgc ttcagcctca 
atctcagcgt ctgaatcggc atcaacgagt 
gcctcagcaa gcacctcagc t.tctgaat:cg 
acaagcgcct cagcttcagc aagtaccagt 
gcctcaacca gtgcatctga atcggcatca 
gcatcggctt cagcatcaac cagtgcctcg 
gcaagtacca gtgcttcagt ctcagcatca 
gcctcggctt cagcaagcac atcagcatct 
gcaagtacca gtgcgtcagc ctcagcgtcg 
gcatcagctt cagcatcaac gagtgcatcg 
gcaagtacca gtgcgtcagc ttccgcatca 
gcgtcggctt cagcaagtac tagcgcctca 
gcaagtatct cagcgtctga atcggcatca 
gcctcagcct cagcgtcaac aagtgcatcg 
gcatcaacga gtgcgtccgc ttcagcaagt 
gcatcggctt cagcatcaac gagtgcgtcc 
gcgtcaacaa gtgcatcggc ttcagcgtca 
gcgtcagcct cagcaagcac atcagcttct 
gcatcgacaa gcgcctcagc ttcagcaagt 
gcgtcggctt cagcaagtac cagtgcgtca 
gcgtcgacaa gtgcgtcggc ctcaaccagt 
gcctcagcaa gtactagtgc atcagcttca 
accagtgcat cagagtcagc aagtaccagt 
gcttcagcaa gtactagcgc ctcagcctca 
accagcgcct cggcctcagc aagtatctca 
gcttcagcat caacgagtgc atcagtctca 
accagcgcgt ctgaatccgc atcaaccagt 
gaatcagcat caacaagtgc ctcggcttca 
atctcagcgt ctgaatcggc atcaacgagt 
gcatcagcgt caacaagtgc ttcggcttca 
acgagtacgt cagcctcagc aagcacatca 
gcctcagcat cgacaagcgc ctcagcttca 
accagtgctt cagcctcagc gtcgacaagt 
gcatcaacca gtgcgtcagc ctcagcaagt 
gcgtccgctt cagcaagtac tagtgcatca 
gcgtcgacaa gcgcctcagc ttcagcaagt 
gcgtcggctt cagcaagtac ctcagcgtct 
gcatcaacga gtgcatcagc ttcagcatca 
gcgtcggctt cagcatcaac gagtgcttca 
gcatcaacaa gtgcctcggc ttcagcaagc 
gcgtctgagt cagcatcaac gagtgcgtca 
gcatcaacca gtgcgtcagc ttccgcatca 
gcttcagcct cagcatcaac cagtgcatca 
gcgtcaacca gtgcctcggc ttcagcaagt 
gcgtcagctt cagcatcaac cagtgcttcg 
gcatcaacga gtgcgtcagc ctcagcaagt 
gcatcagcct cagcaagtat ctcagcgtct 
gcatcaacga gtgcatcggc ttcagcgtca 
gcgtcggctt cagcatcaac gagtgcctca 
gcatcaacga gtgcg1:Gagc ctcagcaagt 
gcgtcggctt cagcatcaac cagtgcctca 
gcatcaacga gtgcgtcagc ctcagcaagt 
gcatcggctt cagcaagtac cagcgcctca 
gcaagtacca gcgcctcagc ctcagcaagc 
gcgtcagcct cagcgtcgac aagtgcgtcg 
gcatcaacga gtgcatcagc ttcagcatca 
gcgtctgaat cggcatcaac gagtgcgtcc 
gcgtcaacaa gtgcttcggc ttcagcgtca 
acgtcagcct cagcaagcac atcagcttct 
gcatcgacaa gcgcctcagc ttcagcaagt 
gcttcagcct cagcgtcgac aagtgcgtcg 
accagtgcgt cagcctcagc aagtactagc 
gcttcagcaa gtactagtgc atcagcttca 
acaagcgcct cagcttcagc aagtaccagt 
gcttcagcaa gtacctcagc gtctgaatca 
acgagtgcat cagcttcagc atcaacaagt 
gcttcagcat caacgagtgc ttcagtctca 



gcctcgggtt cagcatcaac gagtacgtca 9060 

gcatcaaccia gtgcgtcagc ctcagcaagt 9120 

gcgtctgagt cagcatcaac gagtacgtca 9180 

gcctcaacca gtgcgtcagc ctcagcatcg 9240 

gcttcagcct cagcgtcgac aagtgcgtcg 9300 

accagtgcgt cagcctcagc aagtactagt 9360 

gcttcagcgt caaccagtgc gtcagcttca 9420 

acaagtgctt cagcctcagc atcgacaagt 9480 

gaatcagcgt cgacaagcgc ctcagcttca 9540 

acaagtgcgt cagcctcagc aagtactagt 9600 

gcttcggcgt caaccagtgc atcagagtca 9660 

acaagtgcct cggcttcagc aagcaccagt 9720 

gcctcagcct caaccagtgc gtcagcctca 9780 

acgagtgcgt ccgcttcagc aagtactagc 9840 

gcttcagcgt caacgagtgc gtctgaatcg 9900 

actagcgcct cagcctcagc gtcaacaagt 9960 

gcttcagcaa gtactagcgc ctcagcctca 10020 

acgagtgcgt ctgagtcagc atcaacgagt 10080 

gaatctgcat caaccagtgc gtcagcctca 10140 

accagtgcgt cagcctcagc gtcgacaagt 10200 

gcctcagcaa gtaccagtgc gtcagcctca 10260 

gcatctgaat cggcatcaac cagtgcgtca 10320 

gcatcaacga gtgcatcggc ttcagcatca 10380 

gcgtcagctt ccgcatcaac aagtgcctcg 10440 

gcgtcaacaa gtgcttcagc ttccgcgtca 10500 

gcgtctgaat cggcatcaac aagtgcctcg 10560 

gcaagcacca gtgcgtcggc ctcagcaagc 10620 

gcctcagctt cagcaagtac ctcagcatct 10680 

gcaagcacaa gtgcttcagc ctcagcaagt 10740 

gcgtccgctt cagcaagtac tagcgcctca 10800 

gcgtcaacga gtgcgtctga gtcagcatca 10860 

gcttctgaat ctgcatcaac cagtgcgtca 10920 

gcaagtacca gtgcgtcagc ctcagcaagt 10980 

gcgtcggcct caaccagtgc atctgaatcg 11040 

actagcgcct cagcctcagc atcaacgagt 11100 

gcttcagcaa gtactagcgc ctcagcctca 11160 

accagtgcgt cagcctcagc gtcgacaagt 11220 

gaatcagcat caacaagtgc gtcggcttca 11280 

acaagtgctt cagcttcagc aagtaccagt 11340 

gtctcagcgt caaccagtgc ctctgaatcc 11400 

accagtgctt cggcttcagc gtcaacgagt 11460 

gcctcagcaa gcacatcagc ttctgaatct 11520 

acaagcgcct cggcctcagc aagtacaagt 11580 

gcttcagcct caacaagtgc ttcagcctca 11640 

accagtgcgt cagcttcagc aagcacaagt 11700 

gcttcggcat caacaagtgc ctcagcatca 11760 

actagtgcat cagcatcagc atcaaccagt 11820 

gaatcggcat caacgagtgc atcagcatca 11880 

accagtgcat cagtctcagc aagcaccagt 11940 

gcctcagcaa gtatctcagc gtctgaatcg 12000 

actagtgcat cggcttcagc aagcaccagt 12060 

gcctcagcsia gtatctcagc gtctgaatcg 12120 

actagtgcat cagcatcagc atcaacgagt 12180 

gcttcagcaa gcaccagtgc gtcagcctca 12240 

accagtgcct cagcttcagc aagtaccagt 12300 

gcttcagcaa gtacctcagc gtctgaatca 12360 

acaagtgctt cagcttcagc aagtatctca 12420 

gcttcagcaa gtactagcgc ctcagcatca 12480 

acgagtgcgt ctgagtcagc atcaacgagt 12540 

gaatctgcat caaccagtgc gtcagcctca 12600 

accagtgcgt cagcctcagc aagtaccagt 12660 

gcctcaacca gtgcatctga atcggcatca 12720 

gcctcagcct cagcatcaac gagtgcgtcc 12780 

gcaagtacta gcgcctcagc ctcagcgtcg 12840 

gcgtcagcct cagcgtcgac atagtgcgtcg 12900 

gcatcaacaa gtgcgtcggc ttcagcatca 12960 

gcttcagctt cagcaagtac cagtgcgtcg 13020 

gcgtcaacca gtgcctctga atccgcatca 13080 
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acaagtgcct 
gagtcagcat 
accagtgcgt 
gcctcagcat 
accagtgcct 
gcttcagcat 
acgagtgcc^ 
gcctcagcaa 
acgagt:gcat 
gcttcagcat 
acgagtgcgt 
gcttcagcat 
acgagtgcgt 
gcttcagcaa 
accagcgcct 
gcctcagcat 
acgagtgcgt 
gcttcagcat 
cattcgaact 
cctaatacag 
acaggtattg 

SeqID 74 

atgtcaaacg 

cacgaaatca 

tcactagaaa 

gaaaaaatcg 

caagttgcag 

tcagaaatca 

gaaatceiacg 

agccttcaag 

ttcgaaaaag 

gaaggtgtag 

gtaaac 



<2ggcttcagc 
caacgagtgc 
cagcttccgc 
caaccagtgc 
cggcttcagc 
caaccagtgc 
cagcctcagc 
gtatctcagc 
cggcttcagc 
caacgagtgc 
cagcctcagc 
caaccagtgc 
cagcctcagc 
gtaccagcgc 
cagcctcagc 
caacaagtgc 
cggcttcagc 
caacaagtgc 
cacaagttgg 
gtactgagtc 
gattggttgc 



aagcaccagt 
gtcagcctca 
atcaacaagc 
atcagcttca 
aagtaccagt 
ttcggcttcg 
aagtactagt 
gtctgeuitcg 
gtcaaccagt 
ctcagcctca 
aagtactagt 
ctcagcctca 
aagtactagt 
ctcagcttca 
aagcaccagt 
ttcagcttcg 
aagcaccagt 
gtcagcttca 
aaatacttct 
gtcaattgga 
gaaacgccgt 



gcttcggctt 
gcaagcacat 
gcctcggcct 
gcctcaacaa 
gcgtcagctt 
gcatcaacaa 
gcatcagcat 
gcatcaacga 
gcatcagtct 
gcaagtatct 
gcatcggctt 
gcaagtatct 
gcatcagcct 
gcaagcacca 
gcctcagctt 
gcctcaacaa 
gcctcggcct 
gcaagtacat 
ggatcgacag 
tctgtgttac 
aaacgtgatg 



cagcgtcaiac 
cagcttctga 
cagcaagtac 
gtgcttcagc 
cagcaagcac 
gtgcctcagc 
cagcatcaac 
gtgcatcagc 
cagcaagcac 
cagcgtctga 
cagcaagcac 
cagcgtctga 
cagcatcaac 
gtgcgtcagc 
cagcaagtac 
gtgcgtcagc 
cagcaagcac 
cagtttcaaa 
gtaaatccca 
ttggagttct 
siagaagag 



gagtgcgtct 
atctgcatca 
aagtgcttca 
ctcagcgtca 
aagtgcgtca 
atcagcatca 
cagtgcatca 
atcagcatca 
cagtgcgtcg 
atcggcatca 
cagtgcgtcg 
atcggcatca 
gagtgcatcg 
ctcagcaagt 
cagtgcgtca 
ttcagcatca 
cagtgcttca 
ttcagcaaac 
aaaagsiattg 
agcagctgtt 



aaaaaaacac 
aaggggaact 
acgtttcagg 
ttaacagcga 
ttgacttaaa 
gagaaatcgt 
taaacgttgt 
atcgcgtatc 
Gtaaatctgg 
aagctgttaa 



aaacactaac 
tacttacgaa 
tcttttggga 
tgacgtaaca 
cgttattgtt 
atcttcagaa 
cgacatcaaa 
tgacgttgct 
tcttggatct 
aggtgcagca 



gtagaaaaga 
gataaagtta 
atcgatggtg 
agtggtgtta 
gagtaccaaa 
gttgctaaaa 
actaaagaac 
gaatcaacag 
ggtttctcaa 
aatggtgtag 



aagatgctac 
tccaaaaaat 
gtttcttctc 
acgtagaagt 
aaaatgttcc 
tgactgactt 
agcatgaagc 
gagaattcac 
ctgttcaaga 
tatctcacga 



tgttgtagct 
cattggtctt 
aaatcttaaa 
tggtaaaaca 
agctttatat 
ggaaattgtt 
agactcagta 
ttcagaacaa 
aaaagttagc 
aaacactcgt 



13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14328 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
606 



SeglD 75 

atgacaaaag 

cgtgacaaag 

ttagccttgg 

atgctcgaag 

acagctttat 

ttccaggacc 

gttattgtaa 

atgaataagg 

tctggaggaa 

ttgatctgtg 

ttgcta^iaat 

ggtgtggtgg 

tatggaacgg 

tctagcttgc 

ccgtcacttt 

cagattgact 

acttggcttc 

catgataaga 

SeqID 76 

atgaaaaaaa 

gcagcatgca 

tatacagcag 

gtgacttcaa 

gcagttgcag 

aaaggggtta 

ttcgtgaacg 

aattcggtta 

ggtgtcaagg 

tggaactcta 

aaaggaagcg 

ttgttgaaag 



aaaaaaatgt 
tattgacagc 
taggtgagtc 
aaaatggtcg 
cttctcacaa 
caatgactag 
aacaccaagg 
ttggcattcc 
tgcgtcaacg 
atgagccaac 
ctttacaaaa 
caagtattgc 
ttgaggaagt 
ctcagcttgc 
atactgacct 
tcgaaccuiaa 
ttcatgagga 
tccgtgaaaa 



aattttgact 
cattcgcggc 
aggatcaggt 
tattgcccaa 
ggattgggaa 
tttggacccc 
aaaaacagct 
agacgcagat 
tatcgttatt 
aactgccttg 
cgagtatcat 
ggataaggta 
cttctatgac 
tgatgataaa 
gaaaggggat 
agctcctcaa 
tgctccgciaa 
aatgggattt 



gctcgcgata 
gtttcGcttg 
aaatctgtt.t 
ggtagtattg 
caaattcgtg 
attaaaacaa 
aaagaagcga 
agacgtttta 
gctattgccc 
gatgtaacta 
ttcacaacaa 
gcggttatgt 
cctcgccatc 
ggggatcttt 
gcttttgcct 
ttctcagtat 
gtagaaaaac 
gcccatctgg 



ttgtcgtgga 
aactagtcga 
tgacaaagac 
actaccgtgg 
gtgctaagat 
ttggtagtca 
aagaattggc 
atgaataccc 
ttgcctgccg 
ttcaagctca 
tctttattac 
atgcaggaga 
catatacatg 
actcaatccc 
tgcgttctga 
cagagacaca 
cagctgtgat 
ctgac 



atcgtgtatt 
gttcttcaaa 
acccagaaac 
atgggattga 
aggattggga 
aatggtttac 
gtttaaaaca 
aaggcttggc 
cggttgatga 
agttgaccta 
attttgctaa 
ggttgactgc 



tgctacagca 
atcatctgat 
cttggactac 
tggtttattc 
agtctctaag 
ctctgatgga 
cgcagcagat 
agattatcta 
ttatacgtta 
ttctattttc 
accaacagat 
aaaatcttct 



ggtcttgttt 
tcatcagccc 
ctgatttcaa 
actaacgata 
gatggtttga 
gaagaatatg 
aaaaaatcag 
tcaggaactt 
caatacactt 
tggcctctga 
ccgacatcct 
gtagagtttg 



tattagcagc 
ctaaagctta 
gtaaaaatag 
attacggtaa 
cctacactta 
cagaggtgac 
aagctatgta 
caacagattt 
tgaaccagcc 
atgaagaatt 
tgctttataa 
taaaaaatga 



atttgacgtt 
aggagaagta 
cttcacaggt 
tcaggacttg 
tgcgactatc 
gattacagaa 
cattgactac 
attccaatat 
acctgatgtc 
gattattgat 
ccELcgacctt 
aatcgttgag 
gagtctcttg 
aggaacacct 
ctacgcaatg 
ttgggctaaa 
tgcfuiatctc 



aggtgtactt 
tggctatgtt 
tacaacagta 
tcttgctcct 
taagattcgt 
ggctaaagat 
tttagctgsia 
ttcaacagtt 
tgaaccgttc 
cgaaacatca 
tggtccattc 
gcaatattgg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1065 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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gataaagaaa atgtccacct agatactatc aatctagctt actatgatgg atcagatcag 780 

gagtcgctag agcgtaactt cactagtgga gcttatagtt atgcccgtct ttaccctacc 840 

agctccaact attctaaggt tgcagaagaa tacaaggaca atatctatta cacacaatca 900 

ggctctggga ttgctggtct gggtgtgaat attgatcgcc aaagttacaa ctatacttct 960 

aaaactacag attcagagaa agtagctact aagaaggcat tgcttaacaa agatttccgt 1020 

caagccttga attttgctct tgatcgctca gcttactcag ctcaaatcaa tggtaaagat 1080 

ggagcagctt tagcagttcg taatttattt gtaaaaccag actttgtttc agctggtgag 1140 

aagacctttg gtgatttagt cgctgctcaa cttcctgctt atggtgatga gtggaaaggt 1200 

gtgaatttag ctgatgggca ggatggttta ttcaatgctg acaaggccaa ggcagagttt 1260 

gcgaaagcta agaaagcttt agaagcagac ggcgttcagt ttcctattca tctggacgtt 1320 

ccagtagacc aagcatcaaa aaactacata tctcgtattc agtcctttaa acaatctgta 1380 

gaaacagttc ttggtgttga aaatgtcgtt gttgatattc aacaaatgac aagtgatgaa 1440 

ttccttaata ttacttacta tgctgccaat gcttcatctg aggattggga tgtatcagga 1500 

ggagtttcat gggggccaga ctatcaagac ccatctactt acctggatat tttaaaaaca 1560 

actagcagtg aaactacaaa aacatattta ggatttgata atccaaatag cccttcagta 1620 

gttcaagttg gtttgaaaga atacgataaa ttagttgatg aagctgccag agagacaagc 1680 

gacttgaatg tccgttatga aaaatatgca gcggctcaag catggttgac agatagttca 1740 

ctctttattc ctgctatggc ttcttctggt gcagcaccag tgctttcacg aattgttcca 1800 

tttactggag cttctgcgca aacaggctct aaggggtcag atgtttactt caaatatttg 1860 

aaatcacaag ataaagtggt gactaaggaa gagtatgaaa aagctcgtga aaaatggttg 1920 

aaagaaaaag ctgaatcaaa tgagaaagct caaaaagaat tggcaagtca tgtgaag 1977 

SeqXD 77 

atggaaatta atgtgagtaa attaagaaca gatttgcctc aagtcggcgt gcaaccatat 60 

aggcaagtac acgcacactc aactgggaat ccgcattcaa ccgtacagaa tgaagcggat 120 

tatcactggc ggaaagaccc agaattaggt tttttctcgc acattgt.tgg gaacggttgc 180 

atcatgcagg taggacctgt tgataatggt gcctgggacg ttgggggcgg ttggaatgct 240 

gagacctatg cagcggttga actgattgaa agccattcaa ccaaagaaga gttcatgacg 300 

gactaccgcc tttatatcga actcttacgc aatctagcag atgaagcagg tttgccgaaa 360 

acgcttgata cagggagttt agctggaatt aaaacgcacg agtattgcac gaataaccaa 420 

ccaaacaacc actcagacca cgttgaccct tatccatatc ttgctaaatg gggcattagc 480 

cgtgagcagt ttaagcatga tattgagaac ggcttgacga ttgaaacagg ctggcagaag 540 

aatgacactg gctactggta cgtacattca gacggctctt atccaaaaga caagtttgag 600 

aaaatcaatg gcacttggta ctactttgac agttcaggct atatgcttgc agaccgctgg 660 

aggaagcaca cagacggcaa ctggtactgg ttcgacaact caggcgaaat ggctacaggc 720 

tggaagaaaa tcgctgataa gtggtactat ttcaacgaag aaggtgccat gaagacaggc 780 

tgggtcaagt acaaggacac ttggtactac ttagacgcta aagaaggcgc caUggtatca 840 

aatgccttta tccagtcagc ggacggaaca ggctggtact acctcaaacc agacggaaca 900 

ctiggcagacsa agccagaatt cacagtagag ccagatggct tgattacagt aaaa 954 

SeqID 78 

atgaaasiaaa Mtattggac tttagcgata ttattctttt gtttgttcaa taattctgtt 60 

actgctcaag aaatacctaa aaatcttgat ggcaatataa ctcacactca gactagcgaa 120 

agtttttctg aatctgatga aaaacaggtt gactattcta ataaastatca agaagaagta 180 

gaccaaaata aatttcgtat tcaaatcgat aagacagaat tatttgtaac aacagataaa 240 

catttagaaa aaaactgfctg taaattggaa cttgaaccac aaataaataa cgatattgtt 300 

aactctgaaa gtaataattt actaggcgaa gataatttag ataataaaat taaggaaaat 360 

gtttctcatc tagataatag aggaggaaat atagagcatg acaaagataa cttagaatcg 420 

tcgattgtaa gaaaatatga atgggatata gataaagtta ctggtggagg cgaaagttat 480 

aaattatatt ctaaaagtaa ttctaaagtt tcaattgcta ttttagattc aggagtcgat 540 

ttacaaaata ctggattact gaaaaatctt tcaaatcaet caaaaaacta tgtccccaat 600 

aaaggatatt taggaaaaga ggagggagag gaaggaa^aa tatcagatat tcaagataga 660 

ttaggticatg gtacggctgt tgtagctcaa attgtagggg atgacaatat taatggagta 720 

aatcctcacg ttaatattaa cgtctataga atatttggta agtcgtcagc tagtccagat 780 

tggattgtaa aagcaatttt tgatgctgta gatgatggca atgatattat caatcttagt 840 

act:ggacaat. atttaatgat tgatggagaa tatgaggacg gaacaaatga ttttgaaaca 900 

tttttgaagt ataaaaaggc tattgattac gcgaatcciaa aaggagtaat tatagtagct 960 

gcattaggga atgactccct aaatgtatca aatcagtcag atttattgaa acttattagt 1020 

tcacgcaaaa aagt-aagaaa accaggatta gtagttgatg ttccaagtta tttctcatct 1080 

acaatttcgg tcggaggcat agatcgctta ggtaatttat cagattttag caataaaggg 1140 

gattictgatg caatatatgc gcctgcaggc tcaacattat ctctttcaga attaggactt 1200 

aat:aaGttt:a titiaatgcaga aaaatataaa gaagat.tgga ttttttcggc aacactagga 1260 

ggatatacgt atctttatgg aaactcattt gctgctccta aagtttctgg tgcgattgca 1320 

abgattattg ataaatacaa attaaaagat cagccctata attatatgtt tgtaaaaaaa 1380 

tcctggaaga aacattacca g 1401 

SeqZD 79 

atgaaaaaag atgagttatt tgaaggcttt tacctaatca aatcagctga cctgaggcaa 60 
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120 
180 
240 
300 
360 
420 



actcgagctg ggaaaaacta cctagccttt accttccaag atgatagtgg cgagattgat 
gggaagctct gggatgccca acctcataac attgaggcct ttaccgcagg taaggttgtc 
cacatgaaag gacgccgaga agtttataac aatacccctc aagtcaatca aattactctc 
cgcctgcctc aagctggtga acccaatgac ccagctgatt tcaaggtcaa gtcaccagtt 
gatgtcaagg aaattcgtga ctacatgtcg caaatgattt tcasiaattga aaatcctgtc 
tggcaacgga ttgtccgaaa tctcfcacacc aagtatgata aggaattcta ctcctatcca 
gctgccaaga ccaaccacca tgcctttgaa acgggcttgg cctatcatac ggcgaccatg 480 
gtgcgtttgg cagacgctat tagcgaagtt tatcctcagc tcaataagag cctgctctat 540 
gcggggatta tgttgcatga cttagctaag gtcatcgagt tgacggggcc agaccagaca 600 
gagtacacag tgcgaggtaa tcttcttgga catatcgctc tcattgatag cgaaattacc 660 
aagacagtta tggaacecgg catcgatgat accaaggaag aagtcgtttt gcttcgtcat 720 
gtcatcctca gtcaccacgg cttgcttgag tatggaagcc cagtccgtcc acgcattatg 780 
gaagcagaga ttatccatat gattgacaat ctggatgcaa gcatgatgat gatgtcaaca 840 
gctcttgctt tggtggataa aggagagatg accaataaaa tcttcgctat ggataatcgt 900 
tccttctata aaccagattt agat 



SeqID 80 

gtgacgattc taggaaaaga tacagttcaa caatctgcga aaggtgaatc tgtaactcaa 60 

gaagctaoac cagagtataa gctagaaaat acaccaggtg gagataaggg aggcaatact 120 

ggaagctcag atgctaatgc gaatgaaggc ggtggtagcc aggcgggtgg atcagctcac 180 

acaggttcac aaaactcagc tcaatcacaa gcttctaagc aattagctac tgaaaaagaa 240 

tcagctaaaa atgccattga aaaagcagcc aaggacaagc aggatgaaat caaaggcgca 300 

ccgctttctg ataaagasiaa agcagaactt ttagcaagag tggaagcaga aaaacaagca 360 

gctctcaaag agattgaaaa tgcgaaaact atggaagatg tgaaggaagc agaaacgatt 420 

ggagtgcaag ccattgccat ggttacagtt cctaagagac cagtggctcc taatgctgct 480 

cctaagacaa caagtgcacc gcaagcaact gcaggaacaa tgcaagatgt cacctaccag 540 

tcacctgctg gcaaacaatt acctciacaca ggttcagcat caagtgcagc acttgctagt 600 

cttggtctag tggtggcaac suigtggtttt gctttgctag gaagaaagac tagacgtaga 660 

663 



SeqID 81 

atgaatgcag atgatacagt aaccatttat gatgtcgctc gtgaagcagg tgtttccatg 60 

gcgacggtca gccgtgtggt caatggcaat aaaaatgtaa aagagaatac ccgtaaaaaa 120 

gtgctagagg taattgatcg tttggattat cgtccaaatg cagttgcgcg tggtcttgca 180 

agtaaaaaga caaccactgt cggtgtcgtg attccaaata ttaccaatgg ttatttttcg 240 

agtttggcta aggggattga tgatattgca gaaatgtaca agtacaatat tgtcctagct 300 

aatagcgatg aagataacga gaaagaagtt tctgttgtca ataccctctt ttcaaagcag 360 

gtagatggca ttatctatat ggggtatcac ttgacagata aaattcgctc agaattttcg 420 

cgttcacgta ctccgattgt tctcgcagga actgtcgatg ttgagcacca gttgccaagt 480 

gtcaatattg actataagca agcaiacaatt gatgcagtga gttaccttgc taaagaaaat 540 

gagcgtattg ctttcgttag cggtccgcta gtggatgaca tcaatggtaa ggttcgttta 600 

gttggctaca aggaaacctt gaaaaaagca ggaatcactt atagtgaggg tttggtattt 660 

gaatctaaat atagctatga tgatggttac gccttagcag agcgtttgat ttcatcaaat 720 

gcaactgcag cagttgtgac aggtgatgag ttggcagcag gagtcttgaa cggtttggct 780 

gataagggtg tttctgtgcc agaagatttt gaaattatta ctagtgatga ttcacaaatc 840 

tcacgcttta cccgtccaaa cttgacaacg attgcccaac ctctttatga ecttggtgcc 900 

attagtatgc gtatgttgac caagattatg catziaggaag agttggaaga acgtgaagtt 960 

ctcttacctc atggtttgac agaacgtagc tcaacacgaa aacgtaeia 1008 



SeqID 82 

atgaaaaaaa agttagtatt tcctaatctg ttttggtggg gagctgcttc tagcggacct 60 

cagacagaag gtcaatatgg aaaagtacat gaaaatgtga tggactactg gttcaaaacg 120 

catccagaag attttttcga taatgtcgga cctcttgtag ccagtaactt ttttcatact 180 

tacaccgaag atttccactt gatgaaggaa attggagtta attctttccg cacttccatc 240 

caatggagtc gactcatcaa gaatttagag acaggtgagc ctgatccaaa aggtattgct 300 

ttctacaatg ccatcattga agaagctaaa aagaaccaga tggatcttgt gatgaattta 360 

catcattttg atttaccagt ggaacttctt caaaaatacg gtggttggga aagcaaacat 420 

gtagtggagt tattcgtgaa gtttgccaag actgctttca catgctttgg agataaggtt 480 

cattactgga caactt:tcaa tgagccaatg gtcattccag aagcaggata cttatatgct 540 

ttccattatc caciatctaaa aggaaaggga aaagaggccg tacaagtcat ctataatcta 600 

aaccttgcta gtgcaaaagt gattcaacta tatcgctcat tagaacttga tggaaagatt 660 

gggattattt t:aaact:tgac acctgcttat ccaagaagta attctccaga agacttagaa 720 

gcaagtcgat ttacagatga cttctttaac aaagtcttct tgaatccagc tgttaaagga 780 

actttcccag aaagattggt aaaacagcta gagagagatg gcgtgttatg gagtcatacc 840 

gaaaaagagc ttcaactgat gaaatcsiaat acggttgatt ttcttggagt aaactactac 900 

catccaaaac gtgttcaagc acaagcaaat cctgaggaat atcagacgcc ctggatgcca 960 

gaccaatact tcaaagagta tgaatggctg gagcgtcgca tgaatccata tcgtggttgg 1020 

gaaatttttc cgaaagccat ttatgatatt gctatgattg tgaaggaaga atatggtaat 1080 
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atcccatggt ttatcagtga aaacggaatg ggtgttgaaa acgaagcacg gtttatcgat 1140 

gaaaatggag ttatcgatga cgtgtatcgt attgaatttt atgaagaaca ttteiagatgg 1200 

ctacataaag ccattgaaga gggaagtcac tgttttggat accacgcttg gaccgcattt 1260 

gattgctggt cttggaataa tgcatataag aatcgttacg gatttatctc cgttgattta 1320 

gaaacgcaaa agagaaccat caagagctca ggaagatggt atcgcaaagt aagtgacaat 1380 

aacggttittg aagtagaaat tgaggag 1407 

SegID 83 

gtggaaaatc ttacgaattt ttacgaaaag tatcgtgtct atctgactcg tccacglitta 60 

gagcttttgg cagtagttac cattgttttc tgtgctgtac tcgtcttttt tctaaatatt 120 

ccaggaaaag gt-gtcttaaa act:cgataat ggaacgattg tttatgatgg cagtcttgtc 180 

cgcggtaaaa tgaaeggcca aggtaccatt accttccaaa atggagacca atatacaggt 240 

ggcttcaaca atggagcctt caacggaaaa ggtacctttc aatctaaaga aggct:ggacc 300 

tacgaaggtg attttgtaaa tggtcaggct gaaggaaaag ggaaactaac aacagaacaa 360 

gaagtcgt:tt atgaaggaac ttttaaacaa ggcgtttttc aacaaacia 408 

SeqXD 84 

atgttgaata agataagaga ctatttagac tttgctggtt tgcagtaccg taatcctgat 60 

euiagcgggag cagagcgaga gaagatgctg gcattccgcc acaaaggaca agaggcccga 120 

aaggttetta cagaactggc caaagccttt caagcaagcc atccagaatg gcaactccaa 180 

cagactagcc agtggatgaa tcaggcccag cgtttgagac cacatttttg ggtttatcta 240 

cagagagacg gacaagtgac agaacctatg atggccttac gtttgtatgg gacatctact 300 

gactttggaa tttctttgga agtcagtttc atcgaacgta agaaggatga gcaaacactg 360 

ggcaagcagg ccaaagtttt agacattcca accgttaaag ggatttatta tctaacctac 420 

tctaatggtc aaagtcaacg gtgggaggcg aatgaagaaa agcgtcgtac tttacgcgag 480 

aaggtgagaa gtcaagaagt tcgaaaagtt ttagtgaagg t.agat:gtt:cc tatgacagaa 540 

aattcgtctg aagaagaaat cgtagaaggc ttattgaagt cttattctaa aattcttccc 600 

t:attatctag ctacgagaaa a 621 

SeqID 85 

atggttcaga acagttgttg gcaatcaouig agccataagg tcaaggcttt taccttgtta 60 

gaatccctgc ttgccctcat tgtcatcagt gggggattac tcctttttca agctatgagt 120 

cagctcctca tttcagaagt tcgctaccag caacaaagcg agcaaaagga gtggctcttg 180 

tittgtggacc aacttgaggt agaattagac cgttcgcagt tcgaaaaagt agaaggcaat 240 

cgcctataca tgaagcaaga tggcaaggac atcgccafccg gtaagtcaaa gtcagatgat 300 

ttccgtaaaa cgaatgctcg tggtcgaggt tatcagccta tggtttatgg actcaaatct 360 

g^acggatta cagaggacaa tcaactggtt cgctttcatt tccagttcca aaaaggctta 420 

gaaagggagt tcatctatcg tgtggaaaaa gaaaaaagt 459 

SeqXD 86 

atgaaaaaaa tgatgacatt cttgaaaaaa gctaaggtta aagcttttac attggtggag 60 

atgttggtgg tcttgctgat tatcagcgtg cttttcttgc tctttgtacc taatctgacc 120 

aagcaaaaag aagcagtcaa tgacaaagga aaagcagctg ttgttaaggt ggtggaaagc 180 

caggcagaac tttatagctt agaaaagaat gaagatgcta gcctaagaaa gttacaagca 240 

gatggacgca tcacggaaga acaggctaaa gcttataaag aatacaatga taaaaatgga 300 

ggagcaaatc gtaaagt.caa tgat 324 

SeqJD 87 

atgacatcaa aagttagaaa ggcagtcatc cctgctgctg gactaggaac tcgattttta 60 

ccagcaacca aggcccttgc caaagaaatg ttgccaatcg tagacaaacc aactatccag 120 

tttatcgtgg aagaagctct caaatcaggt attgaagata ttctagttgt cactggtaaa 180 

tcaaaacgtt ctattgagga ccactttgat tcaaacttcg aattggaata taacctcaaa 240 

gaaaaaggga aaacagatct: tttgaagcta gttgataaeia caactgacat gcgtctgcat 300 

tttatccgcc aaactcatcc acgcggtctc ggagatgctg ttttgcaagc caaggctttc 360 

gtcggaaatg aaccttttgt cgttatgctt ggtgatgact tgatggatat cacagacgaa 420 

aaggctgttc cacttaccaa acaactcatg gatgactacg agcgtaccca cgcgtctact 480 

atcgctgtca tgccagtccc tcatgacgaa gtatctgctt acggggttat tgctccgcaa 540 

ggcgaaggaa aagatggtct ttacagtgtt gaaacctttg ttgaaaaacc agctccagag 600 

gacgctccta gcgaccttgc tattatcgga cgctacctcc tcacgcctga aatttttgag 660 

attctcgaaa agcaagctcc aggtgcagga aatgaaattc agctgacaga tgcaatcgac 720 

accctcaata aaacacaacg tgtatttgct cgtgagttca aaggggctcg ttacgatgtc 780 

ggagacaagt ttggcttcat gaaaacatcc atcgactacg ccctcaaaca cccacotagtc 840 

aaagatgatt tgaagaatta cctcatccaa cttggaaiaag aattgactga gaaggaa 897 

SeqXD 88 

atgcaaaatc aattaaatga attaaaacga aaaatgctgg aatttttcca gcaaaaacaa 60 

aaaaataaaa aafecagctag acctggcaag aaaggttcaa gtaccaaaaa atctaaaacc 120 

ttagataagt cagtcatttt cccagctatt ttactgagta taaaagcctt atttaactta 180 
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ctctttgtac tcggttttct aggaggaatg ttgggagctg ggattgcttt gggatacgga 240 

gtggccttat ttgacaaggt tcgggtgcct cagacagaag aattggtgaa tcaggtcaag 300 

gacatctctt ctatttcaga gattacctat tcggacggga cggtgattgc ttccatagag 360 

agtgatttgt tgcgcacttc tatct^catet gagcaaattt cggaaaatct gaagaaggct 420 

atcattgcga cagaagatga acactttaaa gaacataagg gtgtagtacc caaggcggtg 480 

attcgtgcga ccttggggaa atttgtaggt ttgggttcct ctagtggggg ttcsiaccttg 540 

acccagcaac taattaaaca gcaggtggtt ggggatgcgc cgaccttggc tegtaaggcg 600 

gcagagattg tggatgctct tgccttggaa cgcgccatga ataaagatga gattttaacg 660 

acctatctca atgtggctcc ctttggccga aataataagg gacagaatat tgcaggggct 720 

cggcaagcag ctgagggaat tttcggtgta gatgccagtc agttgactgt tcctcaagca 780 

gcatttttag caggacttcc acagagtccc attacttact ctccttatga aaatactggg 840 

gagttgaaga gtgatgaaga cctagaaatt ggcttaagac gggctaaggc agttctttac 900 

agtatgtatc gtacaggtgc attaagcaaa gacgagtatt ctcagtacaa ggattatgac 960 

cttaaacagg actttttacc atcgggcacg gttacaggaa tttcacgaga ctatttatac 1020 

tttacaactt tggcagaagc tcaagaacgt atgtatgact atctagctca gagagacaat 1080 

gtctccgcta aggagttgaa aaatgaggca actcagaagt tttatcgaga tttggcagcc 1140 

aaggaaattig aaaatggtgg ttataagatt actactacca tagatcagaa aattcattct 1200 

gccatgcaaa gtgcggttgc tgattatggc tatcttttag acgatggaac aggtcgtgta 1260 

gaagtaggga atgtcttgat ggataiaccaa acaggtgcta ttctaggctt tgtaggtggt 1320 

cgtaattatc aagaaaatca aaataatcat gcctttgata ccaaacgttc gccagcttct 1380 

actaccaagc ccttgctggc ctacggtatt gctattgacc agggcttgat gggaagtgaa 1440 

acgattctat ctaactatcc aacaaacttt gctaatggca atccgattat gtatgctaat 1500 

agcaagggaa caggaatgat gaccttggga gaagctctga actattcatg gaatatccct 1560 

gcttactgga cctatcgtat gctccgtgaa aagggtgttg atgtcaaggg ttatatggaa 1620 

aagatgggtt acgagattcc tgagtacggt attgagagct tgccaatggg tggtggtatt 1680 

gaagtcacag ttgcccagca taccaatggc tatcagacct tagctaataa tggagttitat 1740 

catcagaagc atgtgatttc aaagattgaa gcagcagatg gtagagtggt gtatgagtat 1800 

caggataaac cggttcaagt: ctattcaaaa gctactgcga cgattatgca gggattgcta 1860 

cgagaagttc tatcctctcg tgtgacaaca accttcaagt ctaacctgac ttctttaaat 1920 

cctactctgg ctaatgcaga ttggattggg aagactggta caaccaacca agacgaaaat 1980 

atgtggctca tgctttcgac acctagatta accctaggtg gctggattgg gcatgatgat 2040 

aatcattcat tgtcacgtag agcaggttat tctaataact ctaattacat ggctcatctg 2100 

gtaaatgcga ttcagcaagc ttccccaagc atttggggga acgagcgctt tgctttagat 2160 

cctagtgtag tgaaatcgga agtcttgaaa tcaacaggtc aaaaaccaga gaaggtttct 2220 

gttgaaggaa aagaagtaga ggtcacaggt tcgactgtta ccagctattg ggctaataag 2280 

tcaggagcgc cagcgacaag ttatcgcttt gctattggcg gaagtgatgc ggattatcag 2340 

aatgcttggt ctagtattgt ggggagtcta ccaactccat ccagctccag cagttcaagt 2400 

agtagttcta gcgatagcag taactcaagt actacacgac cttcttcttc aagggcgaga 2460 

cga 2463 

SeglD 89 

atgtcaticta aatttatgaa gagcgctgcg gtgcttggaa ctgctacact tgctagcttg 60 

GUtetggtag cttgcggaag caaaactgct gataagcctg ctgattctgg ttcatctgaa 120 

gtcaaagaac tcactgtata tgtagacgag ggatataaga gctatattga agaggttgct 180 

aaagcttatg aaaaagaagc tggagtaaaa gtcactctta aaactggtga tgctctagga 240 

ggtcttgata aactttctct tgacaaccaa tctggtaatg tccctgatgt tatgatggct 300 

ccatacgacc gtgtaggtag ccttggttct gacggacaac tttcagaagt gaaattgagc 360 

gatggtgcta aaacagacga cacaactaaa tctcttgtaa cagctgctaa t:ggtaaagtt 420 

tacggtgctc ctgccgt.t.a1:. cgagtcactt gttatgtact acaacaaaga cttggtgaiaa 480 

gatgctccaa aaacatttgc tgacttggaa aaccttgcta aagatagcaa atacgcattc 540 

gctggtgaag atggrtaaaac tactgccttc ctagctgact ggacaaactt ctactataca 600 

tatggacttc ttgccggtaa cggtgcttac gtctttggcc aaaacggtaa agacgctaaa 660 

gacatcggtc ttgcaaacga cggttctatc gtaggtatca actacgctaa atcttggtac 720 

gaaaaatggc cta£iaggtat gcaagataca gaaggtgctg gaaacttsuit ccaaactceia 780 

ttccaagaag gtaaaacagc tgctatcatc gacggacctt. ggaaagctca agcctttaaa 840 

gatgctaaag taaactacgg agttgcaact atcccaactc ttccaaatgg aaaagaatat 900 

gctgcattcg gtggtggtaa agcttgggtc attcctcaag ccgttaagaa ccttgaagct 960 

tctcaaaaat ttgtagactt ccttgttgca actgaacaac aaaaagtatt atatgataag 1020 

actaacgasia 'tcccagctaa tactgaggct cgttcatacg ctgeiaggtEia aaacgatgag 1080 

ttgacaacag ctgttatcaa acagttcaag aacactcaac cactgccaaa catctctcaa 1140 

atgtctgcag tttgggatcc agcgaaaaat atgctctttg atgctgtaag tggtcaaaaa 1200 

gaegctiaaaa cagctgctaa cgatgctgta acattgatca aagaaacaat caaacaaaaa 1260 

t ttggtgaa 12 6 9 

SeqXD 90 

atgatagata ciagtggtcag gaacctactc ctgacctttt tcttttgcaa aatgacaaaa 60 

atcataattt ttttgacaac tatacttgtc aaaaagaaaa agatatgtta caatgaattc 120 

aagttaagaa ataggaagca gaaaggagtt ataatgtggg tactaggatt tatactattt 180 
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240 
300 
360 



atgattttct tttattctaa taattctaaa aaaatcaaga aactagagaa taaaatcaaa 
agacttgagc gaaaagagaa aggaaacgca gaaatgtcga gattattaca ^gaaatgatt 
SaaagiaL laattataac gggagtgtat attgggccag ataaetggga asttgt^at 
gitgatgagg aatgggtaaa gctacgacgt gtagataata cgggaaaaga aaaattcaag 420 
ttgoaaegta ctgaggatat ccaaaocgtt gaatttgaog gagag 



atgattttaa gtaaaaatag agaagatggg ttaagaaaat ttgcgactaa «tccgatta 60 

aatactctta gaacattgaa tcatcttgga ttcggacatt acggaggaag tct3«^atc 120 

gtagaagttt tagcggtgct ttatggtgaa ataatgccaa tgactccaga ^t^tttgca 180 

loalgagata gagattattt catattatca aaaggtcacg gaggacoagc t«=t*tacagt 240 

IcacLIatt ?gaatggttt ctttgacaaa gaattottat attctttaaa tacaaatgga 300 

accaaattac cgtctcatcc tgatagaaat ctaacgccag gcatagatat Sacaa^fff^ lH 

tctttaggao aaggaattag tgttgcaact ggacttgcat atggtcagag «*t^9ff^9 ^^0 

agtccctttt atacttaogc tattgttgga gatggtgagt taaatgaggg aeaatgttgg 480 

glggctatac agtttgcttc tcatcaaeag ttatccaact taattgtatt tgttgat^ 540 
aacaaaaaac aattagatgg ttttaoaaag gatatttgta atccaggtga tttcst*f** 

aaattttcag catttggatt tgaatccatt agggtcaagg gttcagatat tagagaaatt 660 
tatgaaggga ttgtccaatt aaaacagtca aataattcat eacctaagtg cattgtatta 

gatactatta aaggtcaagg ggttcaagag otggaagaaa tgaaateeaa tcatcatctt 780 

cgccctaetg tagaggagaa acaaatgtta acttcagttg tagaaagatt aagtcaggaa mo 
titggaggaaa cagaa 

atgaaaaaaa ctacaatatt atcattaact acagctgcgg ttattttagc agcatatgtc 60 

cctaatgaac caatcctagc agatactcct agttcggaag taatcaaaga gactaaagtt 120 
ggaagtatta ttcaacaaaa taatatcaaa tataaggttc taactgtaga aggtaacata 
ggaactgttc aagtgggtaa tggagttact cctgtagagt ttgaagctgg tcaagatgga 
aaaccattca cgattcctac aaaaatcaea gtaggtgata aagtatttac cgttactgaa 

gtagctagtc aagcttttag ttattatcca gatgaaacag gtagaattgt ctactatcct 360 

agctctatta ctatcccatc aagcataaaa aaaatacaaa aaaaaggctt ccatggaagt 420 

aaagctaaaa ctattatttt tgacaaaggc agtcagctgg agaaaattga agatagagct 480 

tttgattttt ctgaattaga agagattgaa ttgcctgcat ctctagaata tattggaaca 540 
agtgcatttt cttttagtca aaaattgaaa aagctaacct tttcctcaag ttcaaaatta 600 
gaattaatat cacatgaggc ttttgctaat ttatcaaatt tagagaaact aacattacca 
aaatcggtta aaacattagg aagtaatcta tttagactca ctactagctt aaaacatgtt 
gatgttgaag aaggaaatga atcgtttgcc tcagttgatg gtgttttgtt ttcaaaagat 
aaaacccaat taatttatta tccaagtcaa aaaaatgacg aaagttataa aacgcctaag 840 
gagacaaaag aacttgcatc atattcgttt aataaaaatt cttacttgaa aaaactcgaa goo 
ttgaatgaag gtttagaaaa aatcggtact tttgcatttg cagatgcgat taaacttgaa 960 

gaaattagct taccaaatag tttagaaact attgaacgtt tagcctttta cggtaattta 1020 

gaattaaaag aacttatatt accagataat gttaaaaatt ttggtaaaca cgttatgaac 1080 

ggtttaccaa aattaaaaag tttaacaatt ggtaataata tcaactcatt gccgtccttc 1140 

ttcctaagtg gcgtcttaga ttcattaaag gaaattcata ttaagaataa aagtacagag 1200 

ttttctgtga aaaaagatac atttgcaatt cctgaaactg ttaagttcta tgtaacatca 1260 

gaacatataa aagatgttct taaatcaaat ttatctacta gtaatgatat cattgttgaa 1320 

aaagtagata atataaaaca agaaactgat gtagctaaac ctaaaaagaa ttctaatcag 1380 

ggagtagttg gttgggttaa agacaaaggt ttatggtatt acttaaacga atcaggttca 1440 

atggctactg gttgggttaa agacaaaggt ttatggtatt acttaaacga atcaggttca 1500 

atggctactg gttgggttaa agacaaaggc ttatggtact acttaaatga atcaggttca 1560 

atggctactg gttgggttaa agacaaaggc ttatggtatt acttaaacga atcaggttca 1620 

atggctactg gttgggttaa agacaaaggc ttatggtact acttaaatga atcaggttca 1680 

atggctactg gttgggttaa agacaaaggc ttatggtatt acttaaatga atcaggttca 1740 

atggctactg gttgggttac agtttctggt aaatggtact atacctataa ttcaggagat 1800 

ttattagtaa acacgactac acccgatggc tatcgagtca atgctaacgg tgagtgggta 1860 

1863 

gga 

SeqID 93 

atggtaagat ttacaggact tagtctcaaa caaacgcaag ctattgaggt tttaaaaggt 60 
cacatttctc taccagatgt ggaagtggct gtcactcagt ctgaccaagc atctatctct 120 
atcgagggtg aggaaggtca ctatcaattg acctaccgca aacctcacca actttatcgt 180 
gccttgtcct tgttggtaac agttctagca gaagctgata aagtagagat tgaggaacaa 240 
gcagcttacg aagatttggc ttacatggtt gactgttctc gaaatgcggt gctgaatgtg 300 
gcttctgcca agcagatgat tgagatattg gctctcatgg gctactcaac ctttgagctt 360 
tacatggaag acacttacca gattgaaggg cagccttact ttggctattt ccgtggagct 420 
tattcagcag aggagttgca ggaaatcgaa gcctatgccc aacagtttga cgtgaccttt 480 
gtaccatgca tccagacctt ggcccacttg tcggcctttg tcaaatgggg tgtcaaggaa 540 
gtgcaggagc tccgtgatgt agaggacatt cttctcattg gcgaagaaaa ggtttatgac 600 



180 
240 
300 



480 
540 
600 
660 
720 
780 
840 
900 
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ttgattgatg gcatgtttgc cacgttgtct aaactgaaga ctcgcaaggt caatatcggg 660 
atggacgaag cccacttggt tggtttggga cgctacctga ttctgaacgg tgttgtggat 720 
cgtagtctcc tcatgtgcca acacttggag cgcgtgctgg atattgctga caaatatggt 780 
ttccactgcc agatgtggag tgatatgttc ttcaaactca tgtcagcgga tggccagtac 840 
gaccgtgatg tggaaattcc agaggaaiact cgtgtctacc tagaccgtct caaagaccgt 900 
gtgactctgg tttactggga ttattatcag gatagcgagg aaaaatacaa ccgtaatttc 960 

cgcaatcatc acaagattag ccatgacctt gcatttgcag ggggagcttg gaagtggatt 1020 

ggctttacac ctcacaacca ttttagccgt ctagtggcta tcgaggctaa taaagcctgc 1080 

cgtgccaatc agattaaaga agtcatcgta acgggttggg gagacaatgg tggtgaaact 1140 

gcccagttct ctatcctacc aagct:tgcaa atctgggcag ELactcagcta tcgcaatgac 1200 

ctagatggtit tgtctgcgca cttcaagacc eiatactggtc taacggttga ggattttatg 1260 

cagattgacc ttgccaacct cttaccagac ctaccaggca atctcagcgg tatcaatccc 1320 

aaccgctatg ttttttatca ggatattctt tgtccgattc ttgalccaaca catgacacct 1380 

gaacaggaca aaccgcactt cgctcaggct gctgagacgc ttgctaacat taaagaaaaa 1440 

gctggaaact atgcctatct ctttgaaact caggcccagt tgaatgctat tttaagtagc 1500 

aaagtagatg tgggacgacg cattcgtcag gcctaccaag cggatgataa agaaagttta 1560 

caacaaatcg ccagacaaga at.taccagaa cttagaagcc aaattgaaga cttccatgcc 1620 

ctctttagcc accaatggct gaaagaaaac aaggtctttg gtttggatac agttgacatc 1680 

cgtatgggcg gactcttgca acgcatcaaa cgagcagaaa gccgtatcga ggtttatctg 1740 

gctggtcagc ttgaccgcat. cgacgagctg gaagttgaaa tcctaccatt: tactgacttc 1800 

tacgcagaca aggatttcgc agcaactaca gccoiaccagt ggcataccat: tgcgacagcg 1860 

tcgacgattt atacgact 1878 

SeqXD 94 

atgtctaatt catttgtcaa gttgttagtc tctcaattat ttgcaaattt agcagatatt: 60 
ttctttagag taacaatcat tgctaacata tacattattt caaaatcagt. aattgccaca 120 
tcactagttc ctatcttaat aggaatatcc tcttttgttg ogagtctttt agttccgttg 180 
gttactaaaa ggttagcgct aaatagggtt ttatctttat ctcaatttgg aaagactata 240 
ttattggcga tactggtagg aatgtttacc gtaatgcaat ccgtagcgcc tttggtgacc 300 
tatc1:attt:g ttgttgcaat tt:ccatact.a gatggttttg cagcacccgt ttcctatgct 360 
attgtgccac gctatgcgac cgatttgggt aaggctaatt cagccttatc aatgactggt 420 
gaagctgttc aattgatagg ttggggatta ggtggactct tgtttgcaiac aattggtctg 480 
ttacctacca cgtgtatcaa tttagtcttg tatatcattt ctagctttct gatgttattt. 540 
cttcctaacg ctgaagtgga ggtgttagag tcagaaacta atcttgaaat tttgctcaaa 600 
ggttggaagt tagttgctag aaatcctaga ttaagacttt ttgtatcagc aaatttattg 660 
gaaatttttt caaatacgat tt,gggtt.t:ct: tcca1:t.atac ttgtttttgt aacggagtta 720 
ttaaataaaa cggaaagtta ctggggatat tctaatacag catactctat tggtattata 780 
attagtggct: taattgcttt taggctatct gaaaagttcc ttgctgctaa a 831 

SeqXD 95 

atgtttgcat caiaaaagcga aagaaaagta cattattcaa ttcgtaaatt tagtgttgga 60 
gtagctagtg tagttgttgc cagtcttgtt atgggaagtg tggttcatgc gacagagaac 120 
gagggagcta cccaagt.acc cactiticttct aatagggcaa atgaaagtca ggcagaacaa 180 
ggagaacaac ctaouiaaact cgatticagaa cgagataagg caaggaaaga ggtcgaggaa 240 
tatgtaaaaa aeiatagtggg tgagagctat gcaaaatcaa ctaaaaagcg acatacaatt 300 
actgtagctc bagUtciacga gt:tgaacaac attaagaacg agtatttgaa taaaatagtt 360 
gaatcaacct cagaaagcca actacagata ctgatgatgg agagtcgatc aaaagtagat 420 
gaagctgtgt ctaagtttga aaaggactca tcttcttcgt caagttcaga ctcttccact 480 
aaaccggaag cbtcagatac agcgaagcca aacaagccga cagaaccagg agaaaaggta 540 
gcagaagcta agaagaaggft tgaagaagct gagaaaaaag ccaaggatca aaaagaagaa 600 
gatcgtcgta actacccaac cattacttac aaaacgcttg aacttgaiaat tgctgagtcc 660 
gatgtiggaag ttaiaactaagc ggagcttgaa ctagtaaaag tgaaagctaa cgaacctcga 720 
gacgagcaaa aaattaagca agcagaagcg gaagttgaga gtaaacaagc tgaggctaca 780 
aggt:taaaaa aaatcaagac agatcgtgaa gaagcagaag aagaagctaa acgaagagca 840 
gatgctaaag agcaaggtaa accaaagggg cgggcaaaac gaggagttcc tggagagcta 900 
gcaacacctg ataaaaaaga aaatgatgcg aagtcttcag attctagcgt aggtgaagaa 960 

actcttccaa gcccatccct gciaaccagaa aaaaaggtag cagaagctga gaagaaggtt 1020 

gaagaagcta agaaaaaagc cgaggatcsia aaagaagaag atcgccgtaa ctacccaacc 1080 

aatacttaca aaacgcttga acttgaaatt gctgagtccg atgtggaagt taaaaaagcg 1140 

gagcttgaac tagtaaaaga ggaagctaag gaacctcgaa acgaggaaaa agttaagcaa 1200 

gcaaaagcgg aagttgagag taaaaaagct gaggctacaa ggttagaaaa aatcaagaca 1260 

gatcgtaaaa aagcagaaga agaagctaaa cgaaaagcag cagaagaaga taaagttaaia 1320 

gaaaaaccag ctgaacaacc acaaccagcg ccggctccaa aagcagaaaa accagctcca 13 80 

gctcc!aaaac cagagaatcc agctgaacaa ccaaaagcag aaaaaccagc tgatcaacaa 1440 

gctgaagaag actatgct^cg tagatcagaa gaagaatata atcgcttgac tcaacagcaa 1500 

ccgccaaaaa ctgaaaaacc agcacaacca tctactccaa aaacaggctg gaaacaagaa 1560 

aacggtatgt ggtacttcta caatactgat ggttcaatgg cgacaggatg gctccaaaac 1620 

aatggctcat ggtactacct caacagceuit ggcgctatgg cgacaggatg gctccaaaac 1680 
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aatggttcat ggtactatct aaacgctaat ggttcaatgg Ceiacaggatg gctccaaaac 1740 

aatggttcat ggtactacct aaacgctaat ggttcaatgg cgacaggatg gctccaatac 1800 

aatggctcat ggtactacct aaacgctaat ggttcaatgg cgacaggatg gctccaatac i860 

aatggctcat ggtactacct aaacgctaat ggtgatatgg cgacaggttg ggtgaaagat 1920 

ggagatacct ggtactatct tgaagcatca ggtgctatga aagcaagcca atggttcaaa 1980 

gtatcagata aatggtacta tgtcaatggc tcaggtgccc ttgcagtcaa cacaactgta 2040 

gatggctatg gagtcaatgc caatggtgaa tgggtaaac 2079 

SeqXD 96 

atgaactatt Cciaaagcatt gaatgaatgt atcgaaagtg cctacatggt tgctggacat 60 

tttggagctc gttatctaga gtcgtggcac ttgttgattg ccatgtctaa tcacagttat 120 

agtgtagcag gggcaacttt aaatgattat ccgtatgaga tggaccgttt agaagaggtg 180 

gctttggaac tgactgaaac ggactatagc caggatgaaa cctttacgga attgccgttc 240 

tcccgtcgtt tgcaggttct ttttgatgaa gcagagtatg tagcgtcagt ggtccatgct 300 

aaggtactag ggacagagca cgtcctctat gcgattttgc atgatagcaa tgccttggcg 360 

actcgtatct tggagagggc tggtttttct tatgaagaca agaaagatca ggtcaagatt 420 

gctgctcttc gtcgaaattt agaagaacgg gcaggctgga ctcgtgaaga tctcaaggct 480 

ttacgccaac gccatcgtac agtagctgac aagcaaaatt ctatggccaa tatgatgggc 540 

atgccgcaga ctcctagtgg tggtctcgag gattatacgc atgatttgac agagcaagcg 600 

cgttctggca agttagaacc agtcatcggt cgggacaagg aaatctcacg tatgattcaa 660 

atcttgagcc ggaagactaa gaacaaccct gtcttggttg gggatgctgg tgtcgggeiaa 720 

acagctctgg cgcttggtct tgcccagcgt attgctagtg gtgacgtgcc tgcggaaatg 780 

gctaagatgc gcgtgttaga acttgatttg atgaatgtcg ttgcagggac acgcttccgt 840 

ggtgactttg SLagaacgcat gaataatatc atcaaggata ttgaagaaga tggccaagtc 900 

atcctcttta tcgatgaact ccacaccatc atgggttctg gtagcgggat tgattcgact 960 

ctggatgcgg ccaatatctt gaaaccagcc ttggcgcgtg gaactttgag aacggttggt 1020 

gccactactc aggaagaata tcaaaaacat atcgaaaaag atgcggcact ttctcgtcgt 1080 

ttcgctaaag tgacgattga agaaccaagt gtggcagata gtatgactat tttacaaggt 1140 

ttgaaggcga cttatgagaa acatcaccgt gtacuatca cagatgsiagc ggttgaaaca 1200 

gcggtteiaga tggctcatcg ttatttaacc agtcgtcact tgccagactc tgctatcgat 1260 

ctcttggatg aggcggcagc aacagtgcaa stataaggcaa agcatgtaaa agcagacgat 1320 

tcagatttga gtccagctga caaggccctg atggatggca agtggaaaca ggcagcccag 1380 

ctaatcgcaa eiagaagagga agtacctgtc tacaaagact tggtgacaga gtctgatatt 1440 

ttgaccacct tgagtcgctt gtcaggaatc ccagttcaaa aactgactca aacggatgct 1500 

aagaagtatt taaatcttga agcagaactc cataaacggg ttatcggtca agatcaagct 1560 

gtttceiagca ttagccgtgc cattcgccgc aaccagtcag ggattcgcag tcataagcgt 1620 

ccgattggtt cctttatgtt cctagggcct acaggtgtcg ggaaaactga attagccaag 1680 

gctctggcag aagttctttt tgacgacgaa tcagccctta tccgctttga tatgagtgag 1740 

tatatggaga aatttgcagc tagtcgtctc aacggagctc ctccaggcta tgtaggatat 1800 

gaagaaggtg gggagttgac agagaaggtt cgcsuttaaac cctattccgt tctcctcttt 1860 

gatgaggtag agaaggccca cccagatatc ttteiatgttc tcttgcaggt tctggatgac 1920 

ggtgtcttga cagatagcaa gggacgcaag gtcgattttt caaataccat tatcattatg 1980 

acatcgaatc taggtgcgac tgcccttcgt gatgataaga ctgttggttt tggggctaag 2040 

gatattcgtt ttgaccagga aaatatggaa aaacgcatgt ttgetagaact getaciactagct 2100 

tatagaccgg aattcatcaa ccgtattgat gagaaggtgg tcttccatag cctatctagt 2160 

gatcatatgc aggaagtggt gaagattatg gtcaagcctt tagtggcaag tttgactgaa 2220 

aaaggcattg acttgaaatt acaagcttca gctctgaut tgttagcaaa tcaaggatat 2280 

gacccagaga tgggagctcg cccacttcgc agaaccctgc aaacagaagt ggaggacaag 2340 

ttggcagaac ttcttctcsia gggagattta gtggcaggca gcacacttaa gattggtgtc 2400 

aaagcaggcc agttaaaatt tgatattgca 2430 

SeqID 97 

atgaaaattt taccgtttat agcaagagga acaagttatt acttgaagat gtcagttaaa 60 

aagcttgttc cttttttagt agtaggattg atgctagcag ctggtgatag tgtctatgcc 120 

tattccagag gaaatggatc gattgcgcgt ggggatgatt atcctgctta ttataaaaat 180 

gggagccagg agattgatca gtggcgcatg tattctcgtc agtgtacttc ttttgtagcc 240 

tttcgtttga gtaatgtcaa tggttttgaa attccggcag cttatggaaa tgcgaatgaa 300 

tggggacatc gtgctcgtcg ggaaggttat cgtgtagata atacaccgac gattggttcc 360 

attacttggt ctactgcagg aacttatggt catgttgcct gggtgtcaaa tgtaatggga 420 

gatcagattg agattgagga atataactat ggttatacag aatcctataia taaacgagtt 480 

ataaaagcaa acacgatgac aggatttatt cattttasuig atttggatgg tggcagtgtt 540 

gggaatagtc aatcctcaac ttcaacaggc ggaactcatt attttaagac caagtctgct 600 

attasiaactg aacctctagc tagcggaact gtgattgatt actattatcc tggggagaag 660 

gttcattatg atcagatact tgaaaaagac ggctataagt ggttgagtta tactgcctat 720 

aatggaiagct atcgttatgt tcaattggag gctgtgaata aaaatcctct aggtsiattct 780 

gttctttctt caacaggtgg aactcattat tttaagacca agtctgctat caaeiactgaa 840 

cccctagtta gtgcaactgt gattgattac tattatcctg gagagaaggt tcattatgat 900 

caaattctcg aaaaagacgg ctacaagtgg ttgagttata cggcttataa cggaagtcgt 960 
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cgctatatac agctagaggg agtgacttct tcacaaaatt atcagaatca atcaggaaac 1020 

atctctagct atggatccca tagtagttca actgtcggtt ggaagstaaiat aaatggtagt 1080 

tggtatcatt tcaeiatcaaa tggttctaaa tcaacaggat ggctgaaaga cggttctagc 1140 

tggtattatt tgaciattatc tggtgaaatg cagacaggat ggtteisiagga eiaatggtttg 1200 

tggtattatc tgggtagttc aggggcaatg aaaacaggct ggtaccaggt ctctggtaag 1260 

tggtattatt cttactcttc aggcgcctta gctgttaata cgacggtgga tggctacaga 1320 

gtaaacagtg atggagsiacg agta 1344 

SeqID 98 

atgaeiagtaa tctttttagc agatgttaaa ggaaaaggta aaaaaggcga aattaaggaa 60 

gt:accaacag ggtatgcgca aaactttcbt: atcaoiaaaga atctagccaa agaagcgact 120 

gctcaagctg taggtgaact tcgtggtasia caaaaatcgg aagaaaaagc tcacgctgag 180 

atgattgcag aaggaaaagc aattaaagca caacttgaag cagaagaaac tgttgtagaa 240 

tttgttgaaa aiagttggtcc agatggtcgt acctttggtt ctattaccaa taagaagatt 300 

gcagaagaat: tgcaaaagca atttggaatt aagattgata aacgtcatat tcaagtacaa 360 

gctccgattc gagcggttgg tttgattgat gtgccagtga aaatctatca agatatcaca 420 

agtgtaatca atcttcgtgl: gaaagaagga 450 

SeqID 99 

atgaagaaaa aaatcttagc gtcactttta ttaagtacag taatggtttc tcaagtagct 60 

gttttaaceia ctgcgcatgc agaaacgact gatgaceiaaa ttgctgctca agataataaa 120 

attagteiact taacagcaca acaacaagaa gcccaiaaaac aagttgacca aattcaggag 180 

caagtatcag ctattcaagc tgagcagtct siacttgcaag ctgauiaatga tagattacaa 240 

gcagaatcta agaaactcga gggtgagatt acagaacttt ctaaaaacat tgtttctcgt 300 

aaccaatcgt tggaiaaaaca agctcgtagt gctcaaacaa atggagccgt aactagctat 360 

atcaatacca ttgtaaactc aaaatcaatt acagaagcta tttcacgtgt tgcbgcaatg 420 

agtgaaatcg tatctgcaaa caacaaaatg ttagaacaac aaaaggcaga taeiaaaagct 480 

atttctgeiaa aacaagtagc asiataatgat gctatcaata ctgtaattgc taatcaacaa 540 

eiaattggctg at gat gc tea agcattgact acgaaacagg cagaactaaa agctgctgaa 600 

ttaagtcttg ctgctgagaa agcgacagct gaaggggaaa aagcaagtct attagagcaa 660 

aaagcagcag ctgaggcaga ggctcgtgca gctgcggtag cagaagcagc ttataaagaa 720 

aaacgagcta gccaacaaca atcagtactt gcttcagcaa acact8iactt aacagctcaa 780 

gtgcaagcag tatctgaatc tgcagcagca cctgtccgtg caaaagttcg tccaacatac 840 

agtacaaacg cttcaagtta tccaattgga gaatgtacat ggggagtaaa aacattggca 900 

ccttgggctg gagactactg gggtaatgga gcacagtggg ctacaagtgc agcagcagca 960 

ggtttccgta caggttcaac acctcaagtt ggagcaattg catgttggcia tgatggtgga 1020 

tatggtcacg tagcggttgt tacagctgtt gaatcaacaa cacgtatcca agtatcagaa 1080 

tcaaattatg caggtaatcg tacaattgga aatcaccgtg gatggttcu tccaacaaca 1140 

acttctgaag gttttgttac atatatttat gcagat 1176 

SeqID 100 

atggtaaaaa gacgtataag gagagggacg agageuicctg aaaaagttgt tgttcctgag 60 

caatcatcta ttccttcgta tcctgtatct gttacatcta accaaggaac agatgtagca 120 

gtagaaccag ctaaagcagt tgctccaaca acagactgga aacaagaaaa tggtatgtgg 180 

tatttttata atactgatgg ttccatggca acaggttggg tacaagttaa tagttcatgg 240 

tactacctca acagcaacgg ttctatgaaa gtcaatcsuit ggttccaagt tggtggtaaa 300 

tggtattatg taaatacatc gggtgagtta gcggtcaata caagtataga tggctataga 360 

gtcaabgata atggtgaatg ggtgcgt 387 

SeqID 101 

gagttgcgac ggctatcaag gttggtggac caggagctct attttggatg tggatggcgg 60 

ctttctttgg aatggctacc aagtatgcgg aaggactctt ggccatcaaa taccgcacca 120 

aggacgacca tggtgcag 138 

SeqID 102 

gactgtatca ggaaacaacc gttcacacgc gatgaaccaa acaaaacgtg ccgtaaaacc 60 

aaaccttcaa aaagttactg ttcttatcga tgg 93 

SeqID 103 

ggacaaagaa accctcgaag aiattgaaaga gttatcagaa tggcagaaac gaaaccaaga 60 

atatctaaaa aagaaggc 78 

SeqID 104 

cagaggaagc tgttcaaaat cttccaccta ttccagaaga aaagtgggtg gaaccagaaa 60 

tcatcctgcc tcaagctgaa cttaaattcc ctgaacagga agatgactca gatgacgaag 120 

atgttcaggt cgatttttca gccaaagaag cccttgaata caaacttcca agcttaceiac 180 

tctttgcacc agataaacca aaagatcagt ctaaagagaa gaaaattgtc agagaaaata 240 

tcaaaatct 249 
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SeqID 105 

ttggtgatta tagttttgaa aatccagtcc aaatcggaga cagactttat tttcaagaca 60 

tggccattta ttcttttgtc aaaaataata cctttaatgg tattggattg ccaagtctct 120 

atctcatgga cgaacaggga gactgtagct tactcaaagc ttttggctat caagacttta 180 

aagggagatt atcatgatgg acagtccaaa aaiaattaggc tatcacatgc cagcagagta 240 

cgaaccccat catggtaccc tcatgatatig gccgactcga ccaggatcat ggccttttca 300 

aggaaaggct gc 312 

SeqXD 106 

gagagactac cagcttttcc tagaagtctt tcagggagga agttggacca aggcggaiacc 60 

aaagaaaaag gctcggatgg aagaagtcct 90 

SeglD 107 

agaaattgcc tctctacttg geiaaagctcc ticaaactatc acactgaaat caagcgtggg 60 

acagtccgac aatgtcttgg aaaagggcgc ttcaaagagg tttattctgc cgactacgct 120 

caacagtctt atgaaaacaa tcgcaagcgc tcggtcaaga aatcaagctt gaccaaggaa 180 

ctaaaggaaa agattctcca ctatcataac caaaaatttt cgcctgaaat gatggttatg 240 

gctaaagggg ttaacgtggg aatttcaacc atttactatt ggattcatca tggasiaattg 300 

gggttaagca agcaggattt gctttaccct agaaaaggaa aagcgcttaa gaaacaggct 360 

agcaccaact tteiaacctgc tggtcaatcc atcgeiacagc ggcctgaagc tatcaatctt 420 

cgcttggaga atgggcatta tgagattgat acggttctac ttacgagatc gaaaaactac 480 

tgcttgattg tcttgacgga tcgaaagagt agacatcaga tcatccgatt gattccaaat 540 

aaaagtgctg aggtggtcaa tcaggctcta aaactcatct taaaacaaca caagattctt 600 

tccatcacgg cagataatgg aacggaattc aatcgcttgt ttgatatatt ttctgaggag 660 

cacatctatt atgcgcaccc ctatgcctct tgggasiaggg gaactaatga gaatcacaac 720 

aggctcattc gtagatagtt acctsiaggga accaagaaaa tgactcccaa agaagtcgca 780 

btcatcgaaa agtggattaa caactatcct aaaaaatgct tggactacaa gtcacccaga 840 

gaagacttct ggatggctcia cttgaacttg aaatttagca aaatggaaat aatttttatt 900 

aaacgcttcG sia 912 

SeqiD 108 

cctgtcatga ctatctcatc gcctactatg aaiaaacatgg atttgtcaac gaaggccagt 60 

cccagtcaac ctttgcaggg gaaacatggt atgatatggt ctgggaaa 108 

SeqXD 109 

acatcatcaa tsiaggataca cacsiaggaaa tcatcgccaa actggactac gacgccccat 60 

cttgccctga gtgcggaaac caat 84 

SeqlD 110 

tacttccttc cacacaagta tgccagagaa agcttatcgc taccctctac csiacaaaata 60 

ttacacagaa aacaaggttc g 81 

SeqID 111 

gcagccttca aaaaagatca aattaatgag cgtgtcgaga aattaggtaa gttaaaacct 60 

attacaataa attacaacgg aaaaticagaa gtaattgata gtaaagaaaa attacaagag 120 

cttatgaata aagccgttaa agacgaagtg gctcaaata 159 

SeqXD 112 

gcttatgcgc attctaaaag aagcgctgga agtggcaggg caggaggcag acaatgtctt 60 

tgccaatgtc aasLataaatg taggagagat tttaagtat 99 

SeqID 113 

cacggccgac cttatcataa gcctcaccaa ccgcatcatc acgggtttcc ccaacaatct 60 

tataatctcc tgcctccgaa acatEiaacca actctgtgtg tccgccgc 108 

SeqID 114 

aaaggaaaaa tcctcctgct accaaggcta accactcaaa gatggcaaag aaaaatccgc 60 

cctgactcac gtaagtcagc aaataataaa gcaautccttg acttccataa tagtcgctgt 120 

aaatcttccc tgtctgatga agcgcccaac ctgcataaaa atcctgcact tcttgtgcac 180 

tcattaagtc gagtaatagc ggtactccta gagttatccc cgttacaagc gtactccata 240 

gtaaaatttt caccaaagga agacgacttg attcacgatg atgcgattct tgttcgattt 300 

ggtattctag aggttcacga ttctccttat gaacttcttc tactctacca tacacactca 360 

tatcgtttct cctgttcaat ttatctgtct 390 

SeqID 115 

tttacggtaa gccatgtatt cctcctttat ttatctttta atccaagacc caaatcaatg 60 

agtttgagtt tcacttcttc caaactcttg cgtccaagat ttcgtacttt catcatctct 120 
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gcttcagatt tttctgtcaa atcatgcaca gtattgatac cggcacgttt taaacagttg 180 

tatgaacgca cagacaagtc cagttcctca atcgtacgat ctaaaatacg gtcgtcagat 240 

tcagtatcag cttctttcat cacttcagtt gacttagcaa tctcagtaag atttgtaaac 300 

aaatcaagat gttctgtcaia aatacgtgct gaaagcccta aagcatcttc tggaataatt 360 

gttccal^ttg tcaagatttc siagggttaat ttgtcgaaac catcattgct acctacacga 420 

gcaggttcca ct 432 

SeqlD 116 

cttgtctgca tgeiagaataa gggctgctac aaggaaagaa acaactgctg ccac 54 
SeqID 117 

ttccattatt tgtcasuEiata ctttttagtt tcagcaataa cgactggcga caagaccaag 60 

agggcaatca agtttggcag agccatcaag gcgttaacga tatctgcgat aatccagacc 120 

at:at:ccaact. cgataaatcc tcctaacaag accatgagca caaaaaccac acgg 174 

SeqXD 118 

ggaaagaagg tat teat aaa ataccctcta tcaagagtct cctcaaaaac aggaccgatg 60 

attacaggca ggacaeuiaga tctagatagtc gataaaaagg ttggttgtcc atttgaaaaa 120 

agcacggtaa aatactcatc a 3.41 

SeqID 119 

tcttcaccag tttttcctaa acttgtaatg gtatctgggg caaataaacc aagagaaagg 60 

cgcaatttcc cattttcgtc tsiaaatgtca ttccacttaa cctttgtctt g 111 

SeglD 120 

tacttaactt ccttctcagt tccgaagata gcttcttcsia aggtcaaatt gacacgatac 60 

tggagatcat ctccttggcg aggagcgttt ggattgcgcg aagaaccgcc tccgccgaag 120 

aaacttgaga aaatatcctc aaaaccaccg aagccacctg ccccattgaa accgcogaaa 180 

ccaccagctc caccaaaacc accattggcg cctgcagcac catactggtc a 231 

SegXD 121 

cagtcatggc gtcctattcc agattcaaaa tgctatacac aagaaaaact cactatcccc 60 

attaaaagaa gaaaagacat caaggacttc taccacaatt ccatccaaag acacaaaa a c 120 

agccataaga gtcacctcct tgattcctat aggctgatta tsiacaagact ggctgaaatt 180 

gtacatgeiaa ataaaatcct aatagtactc attttgtatg tgactaatat tccgtctogc 240 

tccagaaggt acgaagtaaa tagagtt 267 

SeqXD 122 

ctgtttcgtt tttatcgtgt aattgttctt tatcgaggtt ggcatattta tcttttaatt 60 

cttgtgciatt tgcagtacgt tcaaaacgtt ttccgaaagg atcgattcct tgttcgcgga 120 

gcgcagccat tttttcacgg cgaacgatct gctggtcatt tagttcttcc atatgttctg 180 

SeqXD 123 

ataactcacc ctccactaaa ccctgagcat tttgtttc€ia gagtcttttc atctcttggt 60 

ttgaagtctt atcagccaaa agatgataga tttctgagaa agccttcaga tagtaggcat 120 

cctgeiatcag gtaatagcgg aaaatggcag gttctaaatt cccctcttgt aattgtaaaa 180 

SeqXD 124 

acattggcta aagcagtcgg tttgatgtat tctccaccaa ttccaccaaa accattctta 60 

ggccgaataa cgacagattc gtcttctata 90 

SeqXD 125 

cctggttctc cattttcaga gatttccggt gcaggatttt ttggtgtcgc gsiaacgaata €0 

tttccacgtc caccacgacc accgtgggca acgataaatt cttgcccatg ttcaatcaaa 120 

tctgttaaaa ccttgccagt ctccgcatca cgaacagtcg taccttgtgg tactcgaact 180 

ctaaggtcct cagcaccacg accatgcatc cctttggtca tccctttttc accagaatca 240 

gccttgaaa 249 

SeqXD 126 

ataaattcct tgaccttggc cacatcctta tccaaaagaa gggcaccaag aaaggcttca 60 

aaggcatcac caagaatggt gtcacgattg cgaccacctg atttttcttc ccctttaccc 120 

aacttgataa actggtcaaa ctggcaatca cgcgcaaaac cagctaaact ctcctcaogg 180 

acaatcatag cacggagttt tgataggtca ccttcaggct ttttaggata ttttttatat 240 

agatattctg aaatcaataa ctgtagaaca gcgtctccta aaaattccaa gcgttcattg 300 

tgtgaaattt ttaagaggcg gtgctcattg gca 333 

SeqXD 127 

Gcaataggaa aaaggaattg taaagctgaa tgccaatccc accacctgct tgaaaagcag 60 
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aagaccttcc agtcaagaaa gaccaagaga tatggggcaa gccccgaacc aagatataga 120 

gaatcaagga agccaagatt gtcacaa 3.47 

SeqXD 128 

cagccattgg gacactcgaa agccgaagaa catgagacta tctgttcgca taccttcgat 60 

aaccatracga ccgaaaccat accaaatcaa gtaaaiaggcc gtgatatgac ctcgtctgag 120 

actcttccat ^tccgtctaa aaatcagaat Cciaggcaaag ccaagcagat tcca 174 

SeqID 129 

ccttgctctt tacctgatta tgggctggtt ggttctggct atcattcctg ccattatcag 60 

tcaaacgaca cccgttttct ggagtctcat ggtaactggc ggactctgtt atacagttgg 120 

agctggattt tatgccaaga aaaaacctta tttccacatg atttggcatc tctttatcct 180 

agctgcgtcG gcacttcaat acatcgctat tgtttattac atgtaaaaaa gttgagaaat 240 

tcaatctcaa cttttttctt tacacatatt gataaagtac tggtgcaagc gcacatcatc 300 

agtcaattct ggatgaeiaag aacttaccaa catatttttt tcttgggctg caacaatttg 360 

attgttcact gttgc 375 

SeqID 130 

cgagtaaaag ataatcatct ggataagctt gtgaaagctc ttctaaaaag gcgttcatcc 60 

actcagtatt acatccacca gctattaaga aaaatgattc gcctgtatgg gcatcaacag 120 

ctccataaca atagcgaaat tctcgtatiat agtgactatg gacatgtgga cctactccta 180 

ttggagacca acaiagatccc agtttac 207 

SegXD 131 

caagtcatca siaatagacat agcaactaca aataaaacgg aatctgtaaa gagccaaiagt 60 

gagagagaaa ageuiaagatt gacaagcagt aatatactaa aggttagagg gcgaccgata 120 

SeqXD 132 

gcctttaaga gttccaaggt cccatcactt gatccatcat cgaceiaagac atactcsgatt 60 

tctgettcca aatctggaag taaagcttcc agagcc 96 

SeqXD 133 

gataaaactg acccactggc taggaaactt cctgacaaaa gtaagccgtc aacttccttt 60 

tgcaccaaat cactttctcc cgttaacatg gcttcattga cttccgcaaa gccttccaaa 120 

accaaggcat cactaggaat ctgctctcct gcagaceiaaG gaatgacatc tcctagcact 180 

aattcttcag gattaagagc eiacttcc 207 

SeqXD 134 

ttatgccgat: taceiaacaca agcaaggcca cgagggtctg tgaccaatct aacgaagcaa 60 

aataaggtat atagatacct aaattatctc cgccagacgc aattgtcagc aatg 114 

SeqXD 135 

tgcattcaaa gcattggcaa tgagggacag tgcaaaggca atagttgtta cgtaggcaag 60 

gagattcatc ttgcccccat atccgatata gttggtcaca aaggcaaaga ggaaggcgat 120 

gatggaaatg atgatggccg ccaattttac ctgtttttgg ctcatttggt tgggtctgcc 180 

ttcttgcgaa gcttcccact tcttbatagc aaaggtataa atgaggaagg tgacgggata 240 

ggtaatgatg gccgccttat ttccaaggat ataatcaata gcaccggaca aaatggtatt 300 

aacaatiacca aagtaabbtc cccatttgct 330 

SeqID 136 

acgtccacga agctggttat cgatacgacg actttcatga cgttctgtac caataacaca 60 

aagtcctcca agttcacgaa caccttcacc ciagcttgatg tcggtaccac gacccgccat 120 

gttggttgcg atggtiaacgg caccacgttg accagcattc atgatgattt gggcttctct 180 

atagtggttt ttggcattca agacttcgtg aggaacacca gctgcaacca atttcttaga 240 

aatgtagtca ctagtttcaa ccgctactgt accaaccaag acaggttgac ccttttggta 300 

acgagcctta acgtcttcga caaccgcttt aaacttagat tcgatacttg cataaagaag 360 

gtc 363 

SeqID 137 

atctctactg gtgtaccgac ctgbtcgatg tatccattgt taaagactgc aatbcbatca 60 

gataaagtca aggcttcctc t 81 

SeqID 138 

ttaagtacca bgtccagcat aaagtcciatc ttgtgctctt taccgacaca caccattttc 60 

tcaaaatcag ccatatcacc aaaaagagga tccactgcca ta 102 

SeqID 139 

agctgctcat actcatctac caactccaag gcatgctcaa tcgtcggttt atcciaaacca 60 
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acaatattca tctgtgtcac acccatctca gcagccaagg caatttcttc tgcctcacga 120 

cctgcaaatt cttctacatt caagtgtgta tgtgtatcaa aaatcatctc ttctaacctc 180 

gttttctatc ttctattata ccaaaaaaga ggaggggcac ctaatttttc ggtttcccct 240 

cctctcttca atagagagct attctgctat cttttctatc cgatattgcc catctcctat 300 

tccacagtta gagacagaag agattggcta cat 333 

SeqXD 140 

gtaacatctt gcattgttcc tgcagttgct tgcggtgcac ttgttgtctt aggagcagca 60 

ttaggagcca ctggtctctt aggaactgta accatggcaa tggcttgcac tccaatcgtt 120 

tctgcttcct tcacatcttc catagttttc gcattttcaa tctctttgag agctgcttgt 180 

ttttctgctt ccactcttgc taaaagttct gctttttctt tatcagaaag cggtgcgcct 240 

ttgatttcat cctgcttgtc cttggctgct ttttcaatgg catttttagc tgattctttt 300 

tcagtagcta attgcttaga agcttgtgat 330 

SegXD 141 

tattctcctt tcaaccactc cattctcata aggaaaacga cgaaaatcat aaatccaaac 60 

cccaaagcac cacgaatgaa ttggcgaagc aaggtttggt caaaccaacc tgtaaacatt 120 

tccactaacc ataccaagag tgacaggccg ataaagaaa 159 

SegXD 142 

gattatttca agtttcgaac aacttttaca agattttcta cagtaaagcc atattctgcc 60 

aatacttttg gtgctggggc agaggctccg aaggtatcaa tacctagaac ggcaccatcg 120 

agaccaacat atttgtacca gttt 144 

SegXD 143 

agaggcagac gtggattatg cgttgcacga atcaaggctc ctagactagt cattaaacct 60 

aagagaacaa tcgatccgcc taccaaagat agatacagtc caccactctc agctacatcc 120 

ctctccgtcc ccaaaagtcc tatcatctct ttcccagcga agatggacaa asuitcctaaa 180 

aggaaactta atagtaaggt aatcttcaac gcctcagtca ca 222 

SegXD 144 

actcctccat ataccaaaat tcctgccaaa acagctataa taccatittat ttcagctcaa 60 

gatttcaacc aagccceiacg gctctctgga 90 



SegXD 145 

MSRNXVQLNNSFXQNEnTQRRRYLMKERQKRNRFMGGVLXLXMI^ 
TAFATKLKDEDYAAKYTRAKyYYSKSREKVYTXFDIiIiQR 

SeqXD 146 

MDKKKIiLLXIDGSSVAFRAFFAriYQQIiDRETan/AOI^^ 

K^PDEFREQFPFXREIiWHBflGXRHyDIAQYEADDXXGTIiDKIJ^DGro^ 

PDYIiMEEM6IjTPAQFXDIi!CALMGa>KSDNXPGVTKTOEKT6XKIJJ^ 

TXDTKAPXAXGIiEDIiVYS6PDVENIj6KFYDEM6FKQIiK0AIjWSSA^ 

mVGFVWS0 GDKL YftTDKIiEIJ[jQDPIFKDFIiEKTSIiR\TOFK3^^ 

Y6QTYIiVDDETFyGKGVB:KAXPEI^KFZiBHIJ^CKIiAVIiVBTEPXX^ 

IiIiBMQAENELVXBKLTQEXYBl^EEENVNSPKQLGWIiIOTm 

AKXQSTYVXGLQDWXIJUXSKXKTRYVQDLTQTGRItSSVDPHZKd 

XSKDEHLXKAFQEGaDXHTSTAMRVFGXERPDDVTA£n)RRmKA^ 

KtramBVVREARDKGYVBTLFiamREI.PDINSRNFN^ 

DB^^n:lSVP]CSSLVB^QCRlJVKQTHBBAXQIlSVPI■XJU3BNEGATWYE2l^ 

SeqXD 147 

MGMAAFKNPmQYKAXTXAQTIjGDDASSEEIiAGRYGSAVQCTEVTASNLST^ 

YFYESGDVKTGWVKTDGKWYYIiNDIiGVMQTGPVKPSGSWYY^ 

EAGXMKTGWFICSrGFHWYYAYGSGAZiAVSTTTPDGYRWCa^ 

SeqXD 148 

MSRKSIGEKRHSPSMRKLSVGLVSVTVSSFFIJflSQGXQSVSMNMESPXHY^ 

LVyRSQQFLPira5FNPTVGTFLFTAGIiSLLVLLVSK3lENSK3CRL\^^ 

HLPEPLKXEGYQYXGYXKTlUCQDiaTELSRTVDGKYSJUSRDSQPNSTKTSDV^ 

X2yU3MLSSiroSFASCVEQNPDHKGESVVRFTVPEQGNF7SATTVQSAEBEVI^ 

PLFVn rTKP IiBTKGTOGPGHEGEAAVHEBEPAYTBPIATKGa?^ 

PAIiEVTTiamTEXQNIPYTTEEXQDPTLDKNRRKXERQGQaG 

KPTVEITNLTKVBNKKSITVSYNIiIDTTSAYVSAKTOVFH 

ENTETSTQDFQIiBYKKXBXroXDSVEIiYGKENDRYRRYIiSIiSEAPTDTAKYFVI^^ 

VDQIiVEEGTDGYRDDYTFTVAKSKAEQPGVYTSFKQLVTAMQSNLSGVYT^ 

TKSYAXYDIiKRPIiFDll^NGATVRDIiDXKTVSADSKENVAAI^^ 
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GKLIANHQDSimnMGGIViCaraTSffSSRVS^^ 

VGSTWQNGRVNNWSNVDVGDGYVITGDQyiUUVDVKimSTSVDNI^^ 

REVDYTRIiNKa^EaBRKVAYSOTBia^FYNKDLVVHyGNK^ 

NTVEYLDVTPBCBNFINSQVIBYNVTGKBYIPTPBAPVSDYTAITNNV^ 

EVKANIAEHIJUCVIAMDKSIimCTGVVBYVSEKIKraa^^ 

IVAIiGNSGZJDIiliRASOTVGLYANKIiASVK^ 

RKYSIKSVYDRISAPSWGHKSMIiIjPIiLTLPBESVYISSKMSTIiAFGSYERy^ 
DIliraililiDSASKBKLPRSVIVYDGFNVKDETGRTyWARL 
YGTSV7THEMVHNSDSAlYFEGNGRREGIiGJ\ELia^IiLQSVDSVNSH3^^ 
YMHGSYDVMTTXiDAMEiUQ^IiAQNinJVKKS^ 

DDSREYKRlK3yYTISMFSPVYAAI>SNSKG2U?GDIMFRKIAYEIiIiAEKGYHX^ 

VTDDIiVFiaCVFNGEYSSWJU:>FKKA14FKQRlDKQDmiKPI 

TPASWVHLLKQKIYNAYLRTTDDFRNSIYK 

SegID 149 

MKFNPMQRYTRWSIRRIiSVGVASVVVMGFFVLVGQPSSVRM)^ 

GtmVSLPTPTERTEVSEETSPSSIJXrLFEKDEEAQKNPELTDVLKETVra 

i^YLEK^GKGPFTiUSVNQVlPYEIiFAGDGHLTRIiLIi^ 

ALXDQLRANGTQTYKATVKVYGNKDGKADIiTNLVTlTKS^ 

VmVXPYELFAC3DGMLTRI<IjZjKASDKAPWSDNGI3AKN^ 

SATVNVYGEIXTCKPDIJDNlVATKKVTINXNGIiXSKET^ 

SOjTRIJiIiKASDKAPWSDNGDAKNPAIiSPI^^ 

DIiDNr\n\TKKVTIKXNGXiX3KEWQKAVADNVKDSIDVPAAYLEI^^ 
WSDNGDAKNPi^SPI^BNVKTKGQYFYQV!AIJ)(araiGKEKQA^ 
IHVI^SDTi^GSIiSPSNSGSGVTPMNHNH2^TGTTDSMPADa^ 
MASIGFWIiAIiAGIiIiGGXK3IiKNKR£EN 

SeqID 150 

MKSITKKIKATIiAGVAALFAVFAPSFVSAQSSSTYTVI^ 

VATPAPATYAAPAAQDETVSAFVAETPVVSETVVSTVSGSEAEAKBWI^^ 

QERVADAYVAGRYGSWTAAKNFWLNNGWY 

SeqXD 151 

MNKKKMXLTSIiASVAXLGAGFVTSQPTFVRAEESPQVVEKSSLEK^^ 

EKARKEABASQKIilQ>VAIiVVQNAYKEYREVQNQRSK^ 

ETKKKAEEAKAEEKVAKRKCTYATIiKVAIjAKKEV^ 

KLKKGEAEI^NAKQAEIiAKKQTELEKIiIiDSIiDPEGI^ 

AAXiQNKIJykKKAELAKKQTEIiElGLIiDSI^PEGKT^^ 

LQNKIATKKAEIiBKTQKEIiDAAIiNEZiGPIXS^ 

APKPBQPAKPEKPAESPTQPEKPATPKTGfnCQEM<»fWYFYNTDGSHAIGWIi^^ 

GAMKASQWFKVSDKWYYVNSNGAMATGWLQYNGSWYYIifilANGDMATGW^ 

MATGWAKVMGSIimnijgANGSMATGm^KTCDTVVYYI)^^ 

SeqID 152 

MKKIVLVSIiAPIiPVLVOCGQKKETGPATKTBiamiQSALPVIBNAEK^^ 

QIOlPVSDEIiKTYIDQHGVEETQKAIiLEAEEKDKSIISARKIJ^E^^ 

EMIiIiKKEPSlCYISDRIiANGATEQ 

SeqID 153 

MFEVEEWIiHSRIGLNFRSGIiGRMQOAVDLLGNPEQSYPIIHVTGT^ 
NGQPXADaDFIRI<TDQVKEMEKTItIJQTPAQLSFFEIlLTI.VAFLYFREQE^^ 
QETI^SI^EAlAEQKAGXFKAGKICAVXAlCLPPEARIiACQKS^ 
ENAAIiAIiQTFIiIiFMRERKEAVDEQAVRKACiECSTI^^ 

roYQGMLGYIiTEKLPQVELKVTGFDYQGAIiDERDVTGYDXVSSYRBFISDFEERADAQDIi 
N 

SeqID 154 

VGXRVYKPO^GRRNMTSWFABXTTSTPEKSLLVAEiKSKASRN^^ 
DPNRSANIALVHYTDGVKAYXIAPKGLEVGQRrVSGPEADIKVGNaLPLANIPVGm^ 
SBGRYVLVRIiQSGEVEtMXLGTCRATVGWGNEQHGIjVNIiGKAGRSRWKG 
WGKPALGIJCrRNKKAKSDKIiXVRRRNEK 

SeqID 155 

MaJKKSMVAREAKRQKIVDRYAEKRAAIJCAAGDYEGIiSKLPRlIASPI^^ 
VTKASW 



SeqXD 156 
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MDIRQVTETXA^IIS&(^VFDIRTXT^S6ISIJI^IDPDIMI^^ 

GJATDATDYVVIiMCALDKAAKEIgVDPIGGFSMjVQKGYQKGDEILI^ 

IIKETJUJIiSDMGVaKLVVPJ^VEDNPFMAGAPHGVQEM 

QLVGQMJVSERLGVEFGXVDXiSIJ^PAVGDSViaiVLEEMGIiETVGTI^ 

DBCa4IiUVVQNGSLNIiEKIiBl^AICSVGLDMXAlPED^ 

APVMKVNGASSVDFISRGGQIPAPIHSFKN 

SeqXD 157 

MVNTEVARTTIKTEYFGSLT&RMNKyREDVZiNKI^^ 

6NQASSNia3ftPIFPByTLEFVIiHEIJ:>I.FEK3U}GDVF7IT&ET^ 

KMETSGDAHIAVNyQKLIiQFGIiRGFEEIUUUCAKV^^ 

ELLEIlU^ICSRVPYEPATTFAEAIQSVHFIQCIXjQIESNGHSLSYGRFDQYl^ 

XNRVRSQSHTFSSA6SPLYQNVTIGGQTIU3Kia>AVm>LSyLVLKSVA^^ 

AFNNDEXXIPSFlAKGVLEDDAYDYSAlGCVETAVPGRWGyRCTGMSyMNFPKVLLI?^^ 

FSBLENANDKTIiRYLTRHSVIVENSlDIiSLERE^^ILCSiUli 

VFEEBRlSPSQLWHAZiETDYAGEEGKVlQEMIiIHDAPICYGNDDDYADI^ 

SISANVGQGRGTIiATPIX3RNA6TPIiAEGCSPSHNMZ3QmSPTSVLKSVSKL^ 

RTFFNRIiHGYHZQYim^SRETZiIDAQKHPEKHRDLIVRVAGYSAFFNVXiSKATQDD 

SeqID 158 

AHALLQVDPDFRDSLKRAjGOjLTRDSRKVERKKPGIi^^ 
SeqID 159 

I^KKIiTIKDIAEMAQTSKTWSFYLNGICYBKMSQETREKIEECVl^ 

GXEDIASQHGYQVMIGNSNYSQESEDRYIESNLIjLGVDGFIIQPTSNFRKYSRII^ 

AVYDMTQSCIEKGYEHFLI.XTAm'SRIiSTRXERASGFVDALTD2^MM^^ 

CWAIiPIiVFTVXKBWyNIiPQVGXiXGFDMTEWTCFSSPSVSTLVQPSFBEGQQ^ 

F 

SeqXD 160 

MNK GLFE KRCKYSXRKFSLGVASVMXGAAFFGTSPVLADSVQSGSTANIiPA^ 

DGPKTEEELLAIiEKEKPAEEKPKEDKPAAAKPETPKTVTPEWQT^^ 

KKGLTVDANGlO^TVDXiTFKDDSEKGKSRFGVFLKFKDTKNNVFVGYD 

liKSDGQLZflASNinsVNIiFDTVTLPAAVNDHLKNEI^ 

TXQSKVIjKAVXDQAFPRVKEYSI^NGHTLP6CM2QFNQVF 

DNQIJZFDVTKXVNHNQVTPGQiaDDESICLIiSSXSFIiGNALVS 

YGFVSTDKUUUSVWSNSQNSYGGGSNDPmiLTAYKETVGraOTnr^ 

KMVDWQDGAXAYRSXMMNPQGWEKVKDXT^ 

ADXGKRIGGVEDFKTLXEKAKKYGAHIiQXHVlGkSETYPESKTFNEKXLRK^ 

EDLKKKLGDGIiDPXYVDVWGNGQSGDNGAStATHVIiAKEXNKQGWRFAXEWGHGGE 

FXRZmQKDAWVGDYRSYGGAANYPLIjGGYSMKDFEGWCySRSDYNGYVTNL 

KITOPEMRVELVDADNNKVVVTRKSinJVNSPQYRERTVT^ 

LPSDWAKSKVYLYKLTDQGKTEEQEIjTVimGKXTLDLIJVNQPYVLYRSI^^ 

KAEX VKSQGAXroMLRXQGNKBKVSLTQIOjTGLKPin^KYAVYVGW XTVim^EKEVTTYTNKSIJ^IiNYVKAYAHNT 

RRDNATVDDTSYFQBaMYAFFTTGADVSNVTIjTIiSRBAGDQAT^ 

VGGVBOVBDNRTHIiSEKmiPYTQRGWNaKKVDDVXEGNWSL^^ 

AFVVGKGEFQSGRRGTQASNLEMHEIiPNTWTDSKIU^KKAa^ 

NLQXEEXTLTGKMLTENAIiKSIYIiPTVAMTim^KBSMDAIj^^ 

FASLTAPAQAQEGIlANAFDGNVSSLWHTSWNC3GDVGKPAT^^a^ 

ATDWPNNiaKPKDXDFGKTXKAKKXVLTGTKTYGDGGDiCYQSAAEZiXFT^ 

VQASMKYATDNHMiTBRMVBYPADYIjNQLKDSATKPnAPTVEKPEFIM 

DTALXI1ASVSLAI.SAI.FVVKTKKD 

SeqXD 161 

MNKPTILRIiXKYLSXSFLSLVXAAr^rLGGGVPPYYVSKAPSLSESKLVATT^ 

VKAXVSXEDHRPFDHRGXDTXRXLGAPIiRlJLQSNSIiQGGSTLTQQIiimTYFSTSTSDQTXSR^ 

LTYYXNKVYMSNCTJYGMQTAAQNYYGIODLNOTiSLPQIiAIi^ 

AVNTPXTDGLQSIlKSASOTPAYMDNYIIKEVINQVEEETGymlLTTG^^ 

XVDVSNGKVIAQLGARHQSSNVSFGINQAVBTNRDWGSTMKPXTDYAPAIiBYG\^ 

FGNITIiQYAIiQQSRWTOAVETIjiNiCVGIiNRAK^ 

PMYIHICWPSDGSEKEFSNVGraAMFETTAYM^m)MMK^^ 

BLFAGYTRKYSMAVWTGYSIWUJTPLVGNGLTVAAKVYRSMMTYLSE 

PSTESSSSSSDSSTSQSSSTTPSTNHSTTTNPNNNTQQSNTTPDQQNQNPQPAQP 

SeqXD 162 

MSKKRRmtHKKEGQEPQFDFDEAKELTVGQAXRKNBBVBSGVLPEDSIIiDKYTO 
LIQEMREAVBKSBASSEEVPSSEDXLIJ»LPLDDEBQGIiDPIiLIJ5DBNPTBm!BBVEB 
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IiVSVIICVSAyyvyRQV3^TKEIETS(^TTlUIQSDVDDEimiyDAFYTDSN!^ 
KSKYDSIiATQIKAIQDVZGlQFEKPAr\7IXmjyrNA^^ 

SSQASSinrrSEPKFSSSNETRSSRSEVIlMGIiSSAGVAVQRSASRVAYNQSAXro 
QYIIiKRVNIVNGNGyYNIiYKPIXSTyijPTIjNCKTC 

SeqID 163 

MKLIiKKMMQVMiAVFFFGIiIATNTW 

RD2n^FDNQPVNE16IjQEKWYyFGQDGAIiI>ECm)i^VIiEAKrSEmX3i^ 

IJNKIi6lm3DDSYNPLPI6EV3^GWTQDFHVTXDXDRSKPAPWYYLDAS6K^& 

YIlDI!nCE^6DMET6HQYI^NKWYYIJRSSGA^^^^ 

FASS6EWI 

SeqID 164 

MKILK^n^QVGLTVFFFGLLGTSTVFWDSEGWQFVQENGRTYYK^ 

GSTIGFYPNGXiajEGFPKSEWYYFDIQIQVLQBFVGWKTIiEIiCTKDSVGRKYGEK^ 

YIX2SMWYY£i2^EINGENYIiGGBRRAGWlin3DSTWYyiiDPTTO 



SeqID 165 

r^SKVYGVlUDGMNVEGRGSJ^IKDNVLlTAAHNYYI^ 
REYDIiALLIIiEEPIGAICI^TLGLPTSQKNLTGITVTlTGYPSy^ 
SHRVVGVHTLGTOANQXNSAViaiNERNLPFrsrS^ 
KVNGKWYYLNSNGAMVTGSQTIDGKVYNFASSGEWI 

SeqXD 166 

liMKKTFFJjLVZ^LFCLLPIiSVFAIDFKINSyQaDLyiHADOT^ 
AAKNGABIiADVTSEVTEBADGYTVRVyMPGQECTIVEVDIiVnN^ 
KIiFFHTGKLFREGTIEKSMZJ3YTIRIJ)lILPAKRGVEI<mYWPRTDFASARI^ 
IjPSXLSISIiIiI.SVCFYFrnUUCTTPSVKYAiaO!RLn 

DRGNVSXXSEGDAWIiRIiVKEDGIiSSFEKDCIiNIiAFSGKKEETLSNIiFiUDY^ 
IJTQMQBGVRKRVSPWGLPDYYRPIiTGGEKAIjQWMGZU^TXIiPLPIGFGXiFIiySI^ 
IJJNRIXSVliNKAGAEVyYIiWTSFBNMrJREXARIJDOABI^ 
HSTFyHSTAQMSHYASVANTASTYSVSSGSGSSGGGFSGGGGGGSXGAF 

SeqXD 167 

MKSXNKFLTMIiAALLLTASSIiFSAATVFAAGTTTTSVa^ 

TNTNNEX XDENGQTLGVNIDPQTFKIiSGAMPATAMKiaiTBABGAKF^ 

EXEI.PI^NDVVnAH\nrPKinrEAKPKXDKDFKGK&NPDTPRV^ 

XmSTVKVTVDDV^IiBAGDYALiTEVATGFDLKLTDAGIJVICV^^ 

TPKPNKPNBNGDIiTIiTKTWVDATGAPXPAGAEATFDIiVNAQTG 

QEXTTAGBXAVKNWKDENPRPLDPTEPKVVTYGKKFVKVNDKDI^^ 

AIiDRAV2UVYN^TAQQQTQQEKEKVDKAQAAinNAAVXAA^^ 

AGYAIiLTSRQKPEVTATSYSAOXKySXEYTAGSGKDDATKVVNKKXTXPQTC^ 

A 

SeqXD 168 

MAVMAYPIiVSRLYYRVESNQQIADFDRSKATLDEADXDERMKLAQAFW 
HVBXPVXDVDLPVYAGTAEEVLQQGAGHLBGTSLPXGGNSTimVITAHTGLPTAK^^ 
QVKVXEPTNFDDIiLXVPGHDYVTIiLTCTPYMINTHRLI.VRGHRlPY^ 
RIJUmCKQPEKAIiKALKAARKEVKVEDGQQ 

SeqXD 169 

MSRTKLRAIiLGYIiLMLVAaCiXPXYCPGQMVLQSLGQVK^^ 

NYQVSDDPDAVYOYIiSXPSIiEIMEPVYLGADYHHIiOTGIAHVDGTPLPLDGTGIRSVIAGm 

YDNGQEXVEYQMIDTBXXLPSEWBKLESVSSKNIMTIJITCDPXPTFNKRl^ 

RVATSQm<YRGLVVIiAFIi6XXiFVIiWKLARIiIiRGK 

SeqXD 170 

MKNPFFERR<32.YSXRKI^VGACSIiMXGAVLFAGPAIiZVEETAVPE!TSGA^ 

VA XASETA SPASNEAATTBTAKAASAAKPEBKASEVVAETPSAEftKPKSDKE^^ 

PKNTEKTLKPKEXKFIISWEEIjLKWBPGAREDDAXNRGSVVIJ^RRTC^ 

YAFDYWQYIiDSMVFWEGLVPTPDVXDAGHRNGVPVYGTXtFFBIWSNSXADQERFABA^ 

PINQETTGDLVKPIlGBKMRQFMIlYSKEYAAB™HPXKySWYDA^^™YGRyHQM 

KAKiroYTIATANWXGRNPyDWAGimQQGGSYKTKVKWroXIiDBNGiaiRLSI^ 

GDPTGQKPGDKDWYGIAmiVADRTPAVGOT'FTTSFNTGHGiCKWFVDGKVSro 

FTDAYNGGNSICTSGDV AGKT DQDVRLYSTKLBVTiaKTiCLRVAHKGGKGS 

FDLSSIiAGKTXYAVKLFPEHBGAVKDYQPNIiGQIiTXSDNHQBPQSPTSFSVVKQSLKNAQEAEAWQre 
GDSWKLLTGSSSTTXYLPKVSRSASAQGTTQELKVVAVGKNGVRSEAATTTPDWGMT\^ 
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KTEGGEGIEGMIimTlTSIiSDKIVSSAQIiSGSVDXRLTKPRTVVRWVMDI^^ 

VRGNKAHVTOri^KPITAQDWiajNVVTSDNGTPnK?^^ 

ITVYDNFNSQTPIATZjKSEVGGDIiASAFIjDLTNQSGIiIiTniTQLPGra 

IiDIiRG6VIiRVQYBG6TSDKLZIU:iTHA6VSVS6FZ>THHK6E(^TI^ 

YIjVGDSIiDIiSEGRFAV&TSinyrMBEHSFTDSGVBISGyDAQKTORQT^ 

AKNKVVYNFASSSVKEAFIiKAIEAAEQVIiKDHETSTO^^ 

HPS6SAIiia»LIiEia9KALVEKVDIiSPEELTTJaCQSLro 

KKYFAGEDAHVFEIBGIiDEKGQDVDIiSYAS IVKI P I BKDKKVKKVPFL PEGKEAVELAFEQTDSHVI FTAPHFTHYAFVYE SA 

EKPQPAKPAPC^tn'VIiPKPTYQPTSDQQKAPRZiEVQEEKVAFH^ 

TEVXQBAIPBrVEXGTKVKTVPAWATQSKPAQNTAVKSEEASKQLPm!6TAX>A^ 

SeqXD 171 

MSXTSFVIO&IQDXTRNDAGVNGDAQRIEQMSWIiLFIJCXyDSREMVWELEEDEYES^ 

FVNNKIiFKEIJCEIiEXTSimPXRKTXVKSAFEDANNYMKNG 

FYTPRAATDFXAEVIjDPKI/SESMADIiACGTGGFLTSTIJnaiSSQRKTSEDT^ 

KXVHGNTIi&KimiEYTDDEKFDXXBlMNPPFGGSBIiETXKtni^ 

KTRIiKQKIiVDEFmjn^XXRLPHSWAPyTGXHTiaiiFFDKT^^ 

EXXiEGKFYKSKSFTPSBIiAEL£miIjIKKS)FPKEBEBXIiW 

SeqXD 172 

MNNTEFyDRI^SKNASADEXKKAYRKLSKSCYHPDXNKEPGA^ 

GGFNGAGGFGGFEDXFSSFFGGGGSSRNPNAPRQGDDIiQYRVNLTFEEAXFGTEKEVK^^ 
RCHGAGVXNVmW'PIJGN^!RRQVTCDVCHGRGKSXKYPC^^ 
GDIiYVWSVEASDKFEREGTTXFYin^NXiNFVOAAIiGDTVDXPTVH^ 
VNVVTFTGIjin3RQKVAIiKEFAAAGDIiKVNPKKKGFF^ 

SeqXD 173 

MNPNLFRSVEPYQRRYHWSrAOWjXXPLSIJ^PTFXLXFSLVATKBIl^ 

GDI^XKVSETMEESQKTAIiATQX^RliEKQKEGLGXLKQSLBKATDLFSGBDBra 

ANIiSNSSSSAXEQEXTKVQQQXGBYQEIJa>AXXNNRARZiPTGNPHQS 

ASIiKXQQAGXGSVATYDNSIATKXEVLRTQFIjQTASQQQLTVENQLTEIjK^ 

RIPTGTEIAQXFPVITDTRBVIiXTYYVSSDyiiPIJa>KGQTVRIiKIiBKX 

SNEDSKZiXQYGIiQGRVTSVTTKKTYFDYFKDKXX.THSD 

SeqXD 174 

MSKKIiNRKKQIiRNGIiRRAGAFSSTVTRinmETKKVV^ 
IiDRAKTFYDEGXKSASDFKNWTBKEIiIiALKGXGPATXKKIiKEN^SXI^ 

SeqXD 175 

LISLPGIiAAAKPVQADTSXADIQKRGELWGVKQDVPNFGYKDPKTGTySGXBTOLAK^^ 
NEQTOMDIATFTXTDERKKLYNFTSPYYTDASGFLVNKSAKXKKXEDIiWGKT 
YPELXTSLHAHRXDTFSVDRSXIiSGYTSKRTALIiDDSFKPSDYGXVTKKSmmiroYI^^ 
SHTAD 

SeqXD 176 

MSNXSLTTLGGVRElKSKNMYXAEXGBSXFVIiNVGIiKYPBNEQI^^ 
EAKVPVFGSEIiTIEIiAKIiFVKGiroAVKKFNDFHVXDBOT 

ASBSYATDFARIAEXGRDGVIJUjLSDSANADSiraQVASESEVRDEXTQTXADWBG 

VLTGFDIENXVRTAIRIiKKLSLANBXLLXiaPKDMSRFEDHELIXLE^ 

lAKBAFVARVBimXyQAGGVVKIiXTQSIiHVSGHCammLQIiM^ 

GTTMAYENGDFVPAGSVSA<a3XIiIDGNAXGDVGttIVVLRDRKVLSEDGlPIVM 

RESSELXNQTVEBYIiQGDDFDWADIJCGKVRDmiTKYIiFDQTKRRPAXLPV^^ 

SeqXD 177 

MIOCSTVItSLTTAAVXIiAAYAFNEVVIiADTSSSEDAIiNXSDKEKVAENKEI^^ 

XDNtn'SNEEAKXKBEKrSNKSQGDYTDSFVNKNTBNPia(mK\n^ 

TTPD NUDKX KQXBGISSVBRAQKVQPMMNHARKEXGVEBAXDYIiKSXNM 

ASMRFKKBDLKGTDKNYWIiSDKXPHAFNYYNGGKITVEKJTODGRDYFDPHGMm 

SYKMYSDAGSGFAGDETMFKAXEDSXKHNVDWSVSSGFTGTGIiVGEKTWQZ^RAI^^ 

NOTIiKMTDTGNVTRTAAHEDAXAYASAKNQTVBFDK^^ 

LIGJ^LRGKXAVt^IYTKDIiKNAPiaCAMDKG^ 

NPDKICTEVKRiaKKBDFiaDKLEQYYPXDMESPNSNKPNVGDEKBXD^^ 

APGKNXKSmNVXNGKSTYQYMSGTSMATPXVAASTVLXRPia^KEMLBI^^ 

WKEKSQYFASPRQQGAGLINVANAIiEfflEVVATFKOTlJSKGIiVNSYGSXSLKEXTO 

DSLTDRLKLDETYroBiaSPDGKQIVPBIHPEKVKGi^ITFEHDTFTIGWffSSFDI^ 

EALNSNGKKXNFQPSLSMPLMGFAGNWOTffiPXIiDKffAWEBQSRSKTI^^ 

RKDKaniTSIiDQNPEIiFAFNNEGINAPSSSGSKIANXYPIjDSNGro 

LKVISRBHFXRGXIiNSKSmAKGIKSSKLKVWGDiaCWDGLXYNPRGra 
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LTKDYPWQVSYIPVKIDNTAPKIVSVDFSNPBiOCKLITKDTYHI^^ 

TCEVEKNIiEVTOUSBGQSRNMCLDKDGNTIYBIK^ 

SSKYQKIiGBIAESKPKKIiGNGKBGSIiKKinTOV^^ 

BTGKRMBBTOYKYDDKCaillAYDTOTDLEyETEKI^ lEIKAT 

KYDPHSKTMTPDLYAimroiVDGIiAPMDMRLEV^^ 

LTKMBSGKIYSDSBKQQYLIiiaamiliRKGYMiKVT^^ 

GRSTQSVLMSMiDGFNIIRYQVPTFKMiroKGBAIDmGNLV^ 

LSMDKNYFNPSKSNKIYVIOTEFyiJlGKIfiDKGGFNWBIj^^ 

ANQFPDKVTDMDGNVYLQTCYSDIiNZ^VGV^ 

YIMGKErrrSFiroiKQIIDKTIiNIKlVVia3F3\iaS^^ 

EXjEKGYQFDGWBISGETOKTOAGYVINLSKDTPXKPVFKICCBBKK^ 

SQKSDSTia>VTATVIJ3KNNXSSKSTTNNFNiaUPiC£GTASG^ 

SeqID 178 

M6KGHWlmmvySIRKPAVGACSV^!IGTCAVl:-IiGGI^ 

SEPSSTEAIASBKKBDBAVTPKEBKVSAKPBBKAPRIBSOASNQBKPLI^^ 

KEAIKPDADVSTWKmDLPYDWSIFNDFDHBSPAQNBGGQIjISK^ 

GHYPNGYNQFSYDITKYIiQKTCRENVIAVHftVOTCQPSSRWYSGSGIYRI^^ 

VTSKIVNTDDKDHELVABYQrvnBRQOHAVTGLVRTASRTIiKAm 

DAKiO^LFGYRYYHNTPMEGFSUGRSERXlCFHQVSZiHHDHGAIiGAEEl^^ 

GIiLVQEBAPDTWYGQKKPYDYORFFEimATHPEaRKGBKWSDFDIjRTM^^ 

VKVIKDVDKTRYVTMGADKFRFGNGSGGHEKIADELDAVGE^ 

HSNGPBI^NYEQSDYGMDRVGWGKTATASWTFDRDNAGYaGQFIWTGTDYIGBPTPWm 

I'YQSQWSVKKKPM\miIjFH!HlIWENKBIiASKVASSBGn 

LYLEWKVAYQPGTLEAIARDBSGKBIARDKITTAGKPAAVRIiIKEDHAIAft^ 

GQGQIiVGVDNGBQASRERYKAQADGSWIRKAFNGKGVArTOSTEQMKF^ 

KVQTI I GEAPEMPTTVPFVYSDGSRAERFVTWSS VDVSKPGrVTVKGMADGREVBARV^ 

KSVSYVIiIDGSVBEYEVDKWBIABBDKJaatiAIPGSRIQATGYLBGQPlHATO 

QYRTIiAYQAKIjPEVTASAKMAAVTVLQASAANGMRASIPIQPKTCGPIi^ 

SVRAHYQDGTQAVLPADKVTFSTSGBGEVAIRKGMIiEIjHKPGAV^^ 

QBPSIiPATVTVEYDKGPPKTHK^mJQAIPKBKIiDSYQrPBVIjGKVBGIDI^^ 

RTYDSNOTVSSAKVAWDAIRPBQYAKKGVPTVNGRLBOTQLTTKIiHVRVSAOT 

NVNDKLISYNNQPAiromTOWNRTNPEASVGVLPGDSGILSKRSVDOTi 

GNEDHVFNDSAMWKFVTNLK2VPAQIiFJ^EMN^ 

DliANFNPDLTOYYLESVDGKVPAWASVSNNGIiATVVPSVREGEPVRra 

RIiLQVGQAIiELPTKVPVYFTGKlXySTSTKDLTVEWEEVPAEN^ 

NSNQAFASATra3ZDKNSHDRVDYIjMD(a)HSBNRRWTNWSPTPSSNPEVSAG^ 

IjVIiERYVGPEFEVPTYYSNYQAYDADHPFNNPENWEAVPYRADKDIAAGDEINVTFI^ 

FIiAPSEIiPQEGTQSKlLVIXSKEIADFAENRQDYQITYKGQRPKVSVEEmQVZ^ 

HI.TKBKPVSEKTVAAVQBDLPKIBFVEKDI«AYKTVBKKDSTLYM 

RrvnjVGTKPVAQBAKKPQVSBKADTKPXDSSEASQTNKAQLPSTGSAA^ 

SegID 179 

MAPSVVDAATYHYVNKBIISQEAKPLIQTGKPDRNEVVYGLVYQKDQLPQTO^^ 

VGAMGLVVLPSAQAVDPVATIAIiASRBGVVEMBGYRYVGYIjSGDIIjKT^ 

TENQVVETEEAPKEEAPKTEESPKEBPKSEVKPTDDTLPKVEEGKEDSAEPAPVEEVGGEra 

ESICVEQ2^EFVAPREDEKAFVEPEKQPEAPEEEI(JVVESTPKQEESTPD1^^ 

BEPKVEQAGEFVAPREDEQAPTAPVEPEKQPEVPEBEKAVEETPKPEDiaKGIGTKBPVD]^ 

YNZ^PVXiETAKGVYASEPVKQPEVNSBTNKIiKTAID2^I^^ 

DAKQSEV2IEAVERIiTATlEKLVELSBKPII.TLTSTDKiaiiEREAV2k^^ 

TETXSAAFKKnCiBYYKEYTLSTmEYDRGNGBBTETIiENQNXQI^I^^ 

IiiaTSNNQKTTIiIiAVKNIBETTVNGTP\nrKVTAIADOTiVSRT^ 

YRIjGQSMSARl^nrVPllGKSYITBSFTGBSliIiSSEGKQFAITEI^^ 

VZTNViaTGTIiSGRNNVAGFVNNMEroGTRIENVAFFGKIi:^ 

VDYGLTIJ5HLIGTKAIiLTBSVVKGKIDVSNPVBVGaaMKT^^ 

VEGYSSGNRSFRRSKTFTRI.TKEQAnAKVTTFNXTi^KLES 

XVYQGNKZ^NKEHHIiNTKEVIiSVTAMNNNEFITl^ 

DNTTIiVMDXKSXIiBSVEIK}SQTMYQHIiNRIjGDYRVZ^KDLYI^ 

IiNYNRQLFTNXDNMMI>WFXnATEDHVYXAERASEVEBXIQISKHRAFDI^^ 

ERIiGECKSXiBDXKDXVNKAADGYRNYYDFWYRIJ^DNVKQRLI^^ 

KYYNYNGTGAYAAXYPNSDDXRTDVKTVHIiEHVGEYGXSVYTHETTHVS^ 

DEFGSLGIimVFKRKXnMSNQWYXTDPKTIiKTREDXK^ 

NQWDKXRKLSQEBKNEI^QSVNDLVISQQIiMTNRMFCagGXYKPEAXSYm 

RLWGYYGYENGFIiGYASlIKYICQQSKTDGESVLSDEYXXKiaSNNTFNTXBEFI^ 

XiLTIjFKEAVKECDAETLKQBANGinCTVSMNMTVK^ 



SegXD 180 



wo 2004/092209 



PCT/EP2004/003984 



MNiORIiPSKMSLVTIiPIIiAIiPSQSVIiAEBNIHPSSaCBAS^^ 
IQINTSSATTSOWVKQDGAWYYFDGNGNLVKiaWQGSYyiiKATCi^ 
GK^IAQGBWVyDSSYOAWyYIlKSDGSYi«NAWQSNyYLKSDGKMAKGE^ 
AVNBWVDGGRYYVGMGVWKEVQASTASSSNDSNSBYSAAMKAra 

SeqID 181 

MKVIlXJFKNKKVLVWSIiAKSGESiUmijLD 

YraPMIBKAIiAKBIPVLTEVEIAYLISEAPI IGITGSiraKTTTTTra 

BLSSFQLMGVQBPHPEIAVITOTiMPTHIDYHGSPSEYVaiUCMMriQNI^ 

DGAYIiEDGQLYFRGEVVMJUVEreiGVPGSHNVBNMATIAVaKbm 

NIIATQKALSGFDNSKVVLIAGGi:iDRCa5BFDEItVPDITaiiK3C^^ 

GDVVIiIiSPAMASWDMYAHFEVRGDLFIDTVAEIiKB 

SeqZD 182 

VVDKDITKSETABi^VTTNIiVTQSKVSAVVGPATSGATAAAVa^ 
SNYVSEKIiNAKKVVLYTDNASDYAKGIAKSPRBSYKGErWV^ 
RGMGIDKPIVGGDGPWGEBPVQOATABKASNIYPISGPSTTVEVSJ^iUC^^ 
iUCNSGEIKMNLAKTKDFEGVTGQTSFDADHirrVKa^^ 

SeqXD 183 

MSXIiEVKNLSHGFGDRAIFEDVSFRlJIiKGEHIGLVGi^BGRSTE^ 

RDVIiRTAroELFKAEARlin:)LYMKMftEIXU^ 

RTKNOIiIiAKLIiLBKFDIIiIiliDBPTNYI^^ 

YASaCKSQIiEAAYERQQKEXiU>IiKDFVARimVRVATRB9Ma^ 

GYDRPLTKPIiNLTFERNQKVAXXGANGlGKTTUUKSI^ 

QAEVRAAUaiCGIiTTICHZESQIQVIiSG6B0AlC\^CIiIi^^ 

YSGHIDQIWDFNNIiT 

SeqXD 184 

MKKKTOKAKKWQLYAAIGZU^VVVIiGAGGIIiLFRQPSOT 
DEILVSVGDKVSEGQMiVKYSSSEAQAAYDSASRAVARADRHINEIiNQAR 

VGQEVSFTSKVYPDiOCSmSKIiSYXSDYPKiaVGEAASPAAGNinrGSI^ 
LVM3DSKNYVWXVDBQQKAKKVSVSl4GN3U^^ 

SeqXD 185 

SETimEXDSNFAGRXJn:LRAGV]J3ANDGXXSXiU3^^ 

SRBKXJ:iI^QDREIJaCKSJ:.YAAYXQNGEFKTSJ^LLTNlCXFI^^ 

SVTXFPSDYRXPATVIiXVGVAIiIJJTGYTSARIiGKAPTKTAMXRmiU 

SeqXD 186 

MKKiOjTSLALVGAFIiGLSWYGNVQAQESSC^XHFXNVQETC 

KH\n;iTDRVFRRIiKELGVQKIjDFXLVTHTHSDHXGNVDEZiLSTYPW 

SVXQNXTQGD2aiFQFGD^^3XQLYNYENETDSSGELKKXWDDNSNSLXSVVI^^ 

KFiraHHDT£nCSNTia:)FXKNIiSPSIjXVOTSDSIiFWKNGVDSEYVM^ 

PSFQAGWHKSAY<a3mrYQAPDSTGEYAVGWKrBXBGEWYYFNQTGXIiIi^^ 

ENQMEXGWXQDKEQWYYIiDVDGSMKT6WI^YMGK2WYYFAPS6EMKMGWVKDKET^^ 

MKQGWHKKANDWYFYKTDGSRAVGWXKDKDKWYFLKENGQLLV^ 

EVVKKDIiENKETSQHESVTNFSTSQDLTSSTSQSSETSVNKSESBQ 

SeqXD 187 

MDLGPTQRGXSVVSQSYXNVXGAGIJUSSEAAYQXAERGXPVKLYraaRGVKSTPQHE^ 

EMRRXKSSVXIiESABATRVPAGGAIiAVDRDGFSQMVTBKVANHPLXEVVRDEXT^ 

AGFYFYDA^lAPXXDVNTXDMSKVYIiKSRYDKGEAAYIiNAPMTKQ^ 

XKTMLYGPMKPVGLEYPDDYTGPRDGEFKTPYAWQLRQDKAASSItYNXVGFQTHLKWGEQro 

RNSYNDSPNIJ:.EQTYRSKKQFMLFFA6QMTGVB6YVBSAASGLVAGXNAARLFK^ 

PMNVNFGXXKBLEGERXRDRKARYEKXABRAIiADLEEFIfTV 

SeqXD 188 

MLXGXPKBXKNNBNRV^TPAG\niSLVSRGHRVLXETKAGLGSGFTDADYQI^^ 

liRDDXiLIiFTYZiHMAIVAPELADAMLTAKTTETVRDNQGQLPI^ 

TXXGGGVVGTHAARXALGLGAQVTXLDXSSKRLSVLEEVFGSQXQTIiMSN^ 

VKQMRPGSVSLTZiLLTK\ri^KQLTVQRTMNPSMKimrF 

CVKVIiLIKVTIjTYQIiLKDLIVTTIiXSMX 



SeqXD 189 
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MKINIOmAGSVAVIJ^VCSyBLGRHOAGQVKKESmVSYl^^ 
GDHYHYYNGKVPYDAIISBEUjMKDPNYQMaJSDIVNEIKGGYVIKV^ 
GSMDQAVVAAIUVQGRYTTDIXSYIFmSDIIEDTGDAy^ 
AQPMiSENHNIiTVTPTYHQNQGENISSIiLRBIiYftKPIi^ 

RIARIIPLRYMNHWVPDSRPBQPSPQSTPBPSPSPQPjmiPQPAPSNPIDBKLVKEAVRKVGDGXvr 

i^TAAGXDSKIiAKQESLSHKIiGAKKTDLPSSDREFYNKAYDIiIJ^H^ 

IIJ^?IAPIIfflPERIiGKPN2^ITYTDDEIQVaja^ 

AYAKERCaTPPSTDHQDSGOTEAKGAEAIYNRVKAAKKWIiDRM^^ 

KGYTIiBDLIATVK^YVBHPNBRPHSDNGPCaiASDHVRKNKV^ 

STiyrEETEBEAEDTTDEAEXPQVENSVXNAKIJU>AEMiI^ 

KESQPAFIQ 

SeqID 190 

HECFSRK7IAAGSAVIVSZiSIiCAYMiNQHRSQEma>MNRVSYVI)GSQSSQKSS£^ 

GDHYHYYNGKVPYDi^SBEZJ;j4KDPNYQIJa3ADIVNE^^^ 

VNSNVAVAIU3QGRYTTNDGyVFNPlU}IIEI>TGNAY3^^ 

TQSVAKGSTSKPANKSENZK2SLLKELYDSPSAQRYSESIX3LVFDPAKIISRTPN^ 
PISGTCSTVSTNAKPNEWSSIiGSIiSSNPSSLTTSKELSSASDGYIFNPKDZVEEO^ 
XmSIiATPSPSIiPINPGTSHEKHEEDGYGFIXftNRXZl^ESGFVMSHX^ 
SSHEQDYPSNAKEMKDIJ3KKIEEKIi^XHKQYGVKRESI^^ 

NIANPEZxDQPyLPGQTFKYTrASKDYPEVSYDGTPTVPTSIAYKMASQTI^ 

HGNAYIiENNYKWElKLPZPKIiNC^TTRTAGinCIPVTF^^ 

DEEC^^EPKTSEKVHKEIa[lSETGNSTSNSTIlEEVPTVDPVQEKVAKPAESYG^n^^ 

GBAPQGNGEI!TKPSEN6KVSTGTVEN^QPTENKPiy3SIiPBAPNEKPVKPENSTI^ 

KIiEKFTASY6Ii6IiDSVIFN»QX3TXEI^PSGEVlKK29IiSDIiIA 

SeqID 191 

^flKXIJFVAAEGAPFSKTGGIlGDVXGAI■PECSLVKAGHBVAVILPYYD^^ 
FYFXDNQYYFFRGH^n^GDFDIX3ERFAFFQIlAAIBAMERXDFXPDL£lHVHD^^ 
FQGQFSE^OLGDLFGVGFERYADGTLRWNNCXi]^^ 
DADLYNPC2TDAIiZjDYHFNQBDLSGKAKNKAKIiQERA^ 

PAFEGAFSWFAQrYPDKLSTNXTFDVKZiAQEXYAACDLFI^SRFBPCGLSC^^ 
CTGFSFDNIiSPYmjmTFQTAIiDLYiaraPDXWRNLQKQAHESDFSNBTACKSYLDLYHSLW 

SeqXD 192 

MEKYFGEKQERFSFRKLSVGLVSATXSSLFFMSVXiASSSVDAQBTAGVHYICYVADSm^ 

VYKLNSQNQIiAEIiPimSSKNERQAIiVAGASJjAAMGXIjXFAVSKKI^^ 

EIiTSGEKLPLPKEXSGYTYIGYIKEGICrTSESEVSNQKSSVATPTKQQKVDYNOT 

VVEKPFSTEXjXNPRKEEKQSSDSQEQIiAEHKMIiETKKEEKXSPKBKTGVMTI^^ 

EXQENPDIiAEGTVRVKQEGKIiGKKVEXVRXFSVNKE]SVSREXVSTSTTAPSPRX\^^ 

VEPAXQPEXiPEAWSDKGEPEVQPTIiPEAVVTDKGETBVQPESPDTWSDKGBPEQVAP 

PBKTEEVPVKPTEETPVNPNBGTTEGTSXQEAENPVQPAEESTTNSEKVSPDTSSKN^ 

KZJBNSEKTVEBVPVNPNEGTVEGTSNQETEKPVQPAEETQTNSGKXANENTG^^ 

GZSTTSENGQTEPEPSNGNSTEDVSTESOTSNSNGNBEXKQENEIiDPDKICVEBP^^ 

VPSNPNSYFVKVKSSSFKDVYIiFVASXSBERKNDRXLYKXTAKVBKIiQQEXES^ 

XNQNPSGTYHLAASLNANEVEIiGPDERSYXKDTFTGRLXGBKIXSKNYU 

LANEAQNNTKXKQVHVDGVLAGERGXGGIiIJ^^^ 

SimiTSDQTVGGIiAGI.VDRI>AQXQDSYABGDXNiraCHFGRVAGVAGSnjW 

EMKVKZ>TFSSKAmVYllVTIJVIa3BWSICESFBERGT^«£l^^ 

TYKNXEIOJIjPFYlilKATXVKYGNZiVNENSIiLYQKEI^ 

ABYSIiCa9TGIJ^YTPNQFLYIX2TSXXKQVLPDLQICVDYHSEAXRK1^ 

IiASANFVTEGYIiVDKXKRlQCET^IiLLGLTYLERWYOT 

TYGXSIlASQHGTTDLFSTIlEHYRK^^PMTSMIroWFKSETKAYXV^ 

VIiPZiLTLPERSVFVXSTBSSSIiGFGAYSRYRSSDHKAGKAIiNDFVBBNAR^ 

YKFGDDNTVGKATEVADFDNPNPAHQHFFGPV(anCV<3HNQHGAYA 

GRRSGLGPBFFAKGIJiQAPDHPDDATXTXNSXLKHSKSDSTBSRRIiQVIJDPT^ 

liKLimiQKQQIJiRKVTNBYHPDPDGNKVYATN^^ 

ALSimXGTPGDIiMGRRXAYEIiIJ^GFKDGMVPYXSNQYBEEAKQKGKTXMIiYGK^ 

MYQBRQDQFDRIiNK\n!FiaDTTQFWQTFAKK^ 

NDFRSSXFENKK 

SeqXD 193 

MKXNKKYIiAGSVAVIiAIiSVCSYELGRYQAGQDKKESNRVAYXDGDQAGQK^ 
GDHYHYYNGKTOYD AXXS BEIJJiKDPinfQLroSDXVNEXK^ 
RADNAVAAARAQGRYTTDDGYXFNASDXIEDTGEAYrVPHGD 
QPRIiSENHNLTVTPTYHQNQGBNISSLIJlELYAKPLSBRH\mSIX3LIFDP^^ 

lARXIPLRYRSinroVPDSRPEEPSPQPTPBPSPSPQPAPSNPIDEKLVKEAVRICVGDGYVPEBNGVSRYIP^^ 
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DSKIiAKQESIiSHKIiGTKKTDIiFSSDIUSFSNKAyDZJJ^ 

XiaiFERIj6KPMAQITyTDDEIQV3aaiA6KYTTED67IFDPimXT^ 

LTPPSTDHQDSGMTSAKGAEAZYHRVKAAKKVPIiDRMPyNIK2 

IiIJVTVKYYVEHFNERPHSr»296FGNASDHVQR£nCNGQAiy^ 

BESBEFQVETEKVESKIjRBAEDUiGKIQDPIIKSIIAKBTIiTGL^^ 

SeqID 194 

I ISBELIjMKDFimCIiKDEDXVNEVKGGYVXKVDGKyYVyZiK^^ 

GRYTTDDGYIFN^DIIEDrrGDAYrVFHGDHYHlTJPKNEIiSASEI^^ 

P6TTIiTNTSNNSNTNSQASQSNDIDSIiIiKQLyiCLPLSQRH\rBSTC 

XAIUlPIjRYRSiraWVPDSRPEQPSPQPTPEPSP6PQPAPNIjim:)SNSSIiVSQ^ 

NIiESKLSKQESVSirniiTAKKENVAPRDQEFYDKAYinjLTEAHK/^ 

APXTHPERLGKPNSQXEYTEDEVRXAQIiADKYTTSDGYXFDBHDIISDECSD^ 

K6XLPPSPDADVK3^NPTC3}SAAAXYNRVKGEKRXPLVKLFYHVEHTVE^ 

EDIJ?ATXKYYVEHPDERPHSNDGWG10^SEHVIiGKKDHSEDPNia^^ 

TDSSIiKANATETIiAGIJaanJTIiQXMDNNSXMAEAEK^^ 

SeqXD 195 

MPVEXKmKBXHPKIYAYTTPTVTSNEGWIKIGYTERDVTQRIKEQTOT 

SPHDVERRPKTEWPYFNGTPEKSKNLPDKFVC2HDI*SGYQPGKGQDYTrjRQBQE^ 

STYDIiAIU^MBAVNVIiXVTMKPAXi^SWYDDFETFXAGQTT^^ 

VYLGGEHDKLKWVTDLHSmiiLVXDEAHEGVDTFKTDai^F^^ 

SIiEQBBB NPY ESLPQIiNLFTYQMSQMIGEKIiEKGAQXDGEtm>YVFDIiSEPFATO 

KEZjRNEXiIOrrFWIiIiERVASAKAXiKAIJiEEHPXYEN^ 

VTXPEWTGVIjMIiSNIiBCSPiU^YMQAAFRAQNFySWSDNKGKtH^^ 

ENXRELI^ETFFPXXi^RAGKMVBXDAKAVLTTPRQXKAREV^^ 

SDLLDFSDVTVDDEGNAVVDHBXVVNQQMRLFGEKVyGLGBSVAELVTKDE 

ETEQXKKQlTATLENEXRKNDXBRiaSEAIIXKQEIjQQQLIQ5A19DK^ 

KFIEQVEIKRVEQLKQ SAQD EXRDHljeGPARTIPSFIMAYGDOTLTIi^ 

LFDKATPDBAIQEFIiRKiaCEIiADYFKDQKBDXFDYIPPQKTOQXFTPKRVVI^^ 

lAEIiVKRLYNSNSLKEAPPNPEBRIiKHXLEKQVYGFAPSBXIYNISTMFIPGNLSKDISRK^^ 

YFENN 

SeqXD 196 

BflKKXLrVDDEKPXSDXXKENMTKBGYEVVTAENGREAIiBQPEAEQPDXXIIJJLMLPEITC 

EPDKVXGLBIjGADDYVTKPFSNRELQARVKAIJjRRSQPM 

LYHIiASHTGQVITREHia*BTVWGYDYFGDVRTVDVTVRRIiREKIED^ 

SeqXD 197 

MKKKPLAFLLILFPIFSLGIAKAETXKrW^DTAYAPFEFKDSI^^ 

AnAXMAGMTKTKERBKVFTOSDTYYDTKVVIATTKSHKISIOT 

NSLS^^inAMMDDKPVXBXAINQGQDLHIEMDGEAVGSFAFGVKKGSKYEHIiVT^ 

AVPTTTTLAGLKAIPVKAKYXXASDSSFAPFVPQNSSNQYTGIDMErilKAIAI^ 

<3«SVTDARKATFDFSBSYYTANTIIiGVKBSSNIASYEDIjKGKTVGVKNGTASOT 

A^LJTODBPVLKYSISQGQKLKTPXSGTPXGETAFAVKKGAOT 

DETTIiWGLLQNNYKQIiIiSGLGITIiAIiALISPAIAIVIGI I FGMPS VS PYKSIiRVISEIFVDVIRGI PIiMIIiAAFIPWGI PNPI 

BfilTGQQSPiraPVAGTIAIiSIiNAAAYXAEXVRGGIQAVPVGQMEASRSLGISYGK™ 

IJTTrVSAXGLVBIiFQTGKIXXARNyQSPKMYAXIiAXFYLVIXTIiLTRl^^ 

SeqXD 198 

MAFESLTERLCJNVFKNIMKGKXSESDVQEATiaSIRIAIiEADVa^^ 

TAVXiGSDTAElXKSPKXPTXIMMVGLiQGAGKTTPAGKLANKIJ^ 

VPAVBlVRQGLEQAQTiraNDYVLXDTAQRIiQXDBIjLMKraLRDVra^ 

KIDGOTRGGAALSVRHITGKPXKPTGTGBiaTDIETPHPDRMSSRXIiGMGDP^ 

DPIDQIJDQVQIaMGPMEDIlLKMIPGMaNNPAI^C2NMKVDERQIARKRAIVSS^ITPE 

PXKDPNQAKQLMQGVMSQDMNKMMKQMGINPNNLPKNMPOTISGMDM^^ 

XGEFAMKQSMKRMAMKMKKAKKKRK 

SeqXD 199 

MSQIWTKEKFISQVQGGVIVSCOMiPCSBALYNEEPSLMPPMAE^ 

EPYITATMKEVDBI.VECGTTVIAFDATIiRPRYDGLWSEFXKia:KEKYPNQI^^ 

TSVQSDEPDFBIiMKKIJU)PNXPVIAEGKIHyPEQIjKKAYSLGVTSVVIGGAIT^ 

SeqXD 200 

MRYLTAGBSHGPRX,TAIXEGIPAGLPLTAEDINBDIiRRRQGGYGRGGRMKIEiroQVVFTSGV^ 

WIiDIMSABDXEDRXJCSKRKIlHPRPGHADLVGGXKyRFDDLRNSLKRSSARETTD^ 

KEIDVPENLTVABIKQRAAQSEVSXVNQEREQBXKDYIDQIKRDGDTIGGVVETVVGGVPVG 
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XIlAFK6VEF6Ii6FBA6YIUC6SQVBa)BIIJ9SKED6in:^ 
ATVERSPPTALPJU^<agVMB?^V^a^TVIAQBIIiEKFSSDOTlBmiK^ 

SeqXD 201 

MVVMNRXRVSKRVEKKIiAKGLVliIiEASDZiENVmi^^ 

AKEKRNAYYQDDLTTAFRIiFNQEGIXSFGGIiTVDLYGDYAVFSWyNSYVYQIROTXSBAro 
HVy6QEAFDFFNVLENGmjyQVFMim6IiMTGIFU>QHEVR6SL^^ 
SREIiSQAHFQaNGLSTDEHRFrmDVFBYFKYAKRKDLTro^^ 
STSOUUTVSRQKFTBQIDKGFAGRSYQXIJmYQIiPWFAVinCKDESSN^^ 

SeqlD 202 

^STKTIlKRPEVLSPA6TI.EKIlKVAVQY6A^ 

BWFRKLRDXGIAAVXVSDPALIMIAVTBAPGLBIHLSTQASATNYE^^ 

FVHGAMCXSYSGReETiSNHMSMRD2tf9RG6CSQSC3i^^ 

LKXEGRmSXHYVSTVTNCYKAAVDAYLESPEKFEAXKQDIiVD 

WSYDDAAQTATXRQRNVXiaBGDQVBFYGPQFraFBTYXEDIiHDAKGMKXDIUk^^ 

IiYKEDGTSVTVRA 

SeqXD 203 

MNTYQIJINGVEXPVI.6FGTFEAia36EEAYRAVI£AIiK^ 
QTRQALEKSIBKLGIJYIiDIiYIiXHWPNPKPLRBMD^^ 
HQVRLAPGVYQDQVVAYCREKGXIiIiBAWGPFGQGELFDSKQVQEXAAl^ 
CFGXEIiSKBEREnjCTXAVQSGAPEtVDDVDF 

SeqXD 204 

LSEKSREEEKLSFKEQXIiRDLEKVKGYDEVl^KSDEAVVRTPA£3EPSTEBI^^ 

XQRETPGVPSHPSQDVl>SSPAEESGSRPGPGFVRPICia^ER£YNBTPTRVAA^ 

RSRRBG2UCPVKPKKEKKSHVKAPVXSPLVFIAIiIiSAGGYPGYQYV^ 

iCHGIiXFSFYAKYKNYTDIiKAGYYmiQKSMSTBDIiLKEIiQKGGTDEPQEPVI^^ 

AFIiAKVQDETFXSQAVAKYPTIjIiESIiPVKDSGARYiaiEGYI.FPATYSXKEST^ 

NBIiLTXASIiVEKBGAKTEDRKIiXAGVFYimiiNRDMPLQSNXAXLYAC^ 

DSPSLDAXESSXNQO^DinJYFVADVTEGKVYYAMNQEDHDRNVAEHVNSKI^ 

SeqXD 205 

MR^ERFPLVSDDEVMLTEMPVMNLYDESDiaSNXKGEYimKim:!^^ 

REEARADLKlCKRSAimiiTQDFSIiAIUUaSQPSLVRQGNQPTAPFQKBNPGBFVKY^ 

PK3CNNYDFI<KKSQrsrNKKSKQTEQERRVAQEIiNL^^ 

SeqXD 206 

MKKSKSKYIiTIiAGXiVLGTGVZjLSAOGNSSTASKTrYNYV^ 

WTVSQDGLTYTYKLRKDAKWFTSBGEEYAPVTAQDFVTCLQYAADKra 

TVQYTLVKPELYTOSKTLATILFPVNADFLKSKGDDFGKADPSSILYTO 

TYYDGSDQBSLBRNFTAGAYTTARLPPNSSSYEGIiaSKyKNNIIYSMQNST^ 

NKNFRQAXNFAFDRTSYQAQSEGKBGATKIIjRNLVVPPNFVSXKGiaOFGBVVASK^ 

KFAEAKKBIiEAKGVQFPIHIiDKTVEVTDKVGIQGVSSIKQSXESVLGSDNVVXDIQQM^ 

GWGPDYQDPSTYLDIPNTNSGGFLQjnjGLBPGEAmKAKAVQIiDVYTO 

SRGGTPSIiRRTVPFAAAYGIiTGTKGVESYKYIiKVQDKr«m!DBYAKA^ 

SeqXD 207 

VBQHSDVCYXFYRRBRIiKTKIGIASXCMiGIjATSHVAANBTBVM 

YYVKDlX3SKAQSBWIPDimrKAWFYINSDGRYSQNEWHGMYYI.KSGGYMAQNEW 

WYYFKKWGYMAKSQWQGSYFIiNGQGZkMMQNBWLYDPAYSAYFYLKSDGTYANQEWQK^^ 

GSG2^TDBVXMDGTRYXFAASGELKEKKDIiNVGWVHRDGKRYFFNNR^ 

GYIVRLGYSGBCEDKEIiAHNIKmNRLGIPYGVYLYTYAENBTDAE^ 

SDTGTmraCINKYMDTMKQAGYQNVYVYSYRSLLQTRIiKHPDXIOT 

RVDVSVWY 

SeqXD 208 

MAKEPWQBDXYDQEBSRABRRHRNHGGADRMAiniXLTXIiASXFFUT^nnn^ 
SSSSQPEQSSBPESTSSSSBE2U^B6TXKVIaAGEGBAAlAARAGXSXAQIJBAIiNPGH^^ 

SeqXD 209 

MPITSLEIKDKTFGTRFRGPDPBBVDEFIJJIVVRDyEDIiVRAiniDKNDLRXra 
AAHERSNNXI HQAB QDAQRLIiEEAKYlCANEIlJtQATDMftKKTOVBTBBI^ 

ATYIlQTSDBAFKBWSEVIIGEPXPAPIEBEPXD^mlQFSOaEMABLOftRXBVADKEM 
liLXQXiAQCMKMQK 



SeqXD 210 
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MSLKDRFDRFXDYFTEDEDSSLPYEKRDBFVFTSVNSSQEPUPHNQPSQSJ^TKE^ 

VRYPRKYEimTBIVDIiLAtaiBSIIirDPQYMTEVOAIUlCLD 

GBF6FDMKRNRVR 

SeqID 211 

MSDZiIOCYEGnniPAFia^CyDDQGEVSPERTRALVQyFro 
CNNTKDSMEIiAIUI^SZiGVDAXATXPPXYFRIiPEYSVAKYWNDlS 
KNSS^4PVQDIQTFVSIJGGEDHIVF^IGPDEQFI^RIJ4GARA6I^ 
TSAHGNMYGVlKBVIiKlNEGIiNIGSVRSPLTPVTEBDRPVVE^ 

SegXD 212 

MNKIU3I>YSKL6ISVVGISLIiMGVPTLIHANEIiN7 

U>iU^FGXiSNSKiU^FlCNNYFSIFMIU)SGE16VEIRI]AQI^ 

FSETVDTFIiPISNIM6IDICATIX3AVimE6KEHnAK^ 

YTLSSGRVIiSSIDARYGGTHDSKSKINIATSYSDDinSKTWSBPIFAMKF^ 

DICECSOKTIIJADVMPASIGNNmNKADSGFKBlNGHyYL 

SLTVEQYSVDFDSGSIiRSIUINGKQf\nPMNVFYroSLFICVTPTI^ 

KSSlflRIiIFATrrSGELTYLISDDSGQTWKKSSASIPFKNATl^QMTO 

raiQQTSYGTQVSAXKYSQLIDGIOESAVILSTPNSRSGRKGGQIiVVGLVHKE^ 

GVIiFEKYDSWSRNEIiHIiSNWQYXDLEXMDLTK 

SeqXD 213 

MimSVQERKCRYSXIUa^SVGAVSMXVGftVVFGTSPVI^ 

KDKQEEKXPRDYYAIU3£iENVETVXBKEDVETN2kSIK3QRVDXiSSEIiDK^ 

YFTMAVYinmkTLEGRGSIXSKQFyNNYinSAPLICVl^ 

THVQXGATKRANNTVWGSNLQXSNLTVYimALTPEEVQEJS^ 

XPAIiLKTDK6TLXAGM>EREUjHSSDWGDXGMVXRRSEDmKTWGDRVTXTNI^^ 

SIYDMPPEGKGXFGMSSQKBEAYKKIlXSKTYQXIiYRBGEKGAYTXRBNGTVYTPTOKATDYRVV^ 

IiIiGNXYFTTHKTSPFRXiUa)SYLPgMSYSDDIX3KTWSAPQDXTPf4^^ 

TWEKDXKRYPQVKDVYVQMSAXHTMHBGKEYXXLSmGGPKIlENGMV^^ 
YGXLYEHTEKGQNAYTIiSFRKFNWBFIiSKlQLXSPTBANRDGQI^ 

VCSRGRYRTGNYWYSRKHRKYi^SGKSSRCQSSWRSKWQSSGASSRXYRGSWYRASCSI^ 
AGTSNRKQGEPPSFTRTNSFLPWSVYARBKERT 

SegXD 214 

MXQXGKXFAGRYRXVKQX6RGGMM)VyiiAia3LXIiIX3BEVAV!^^ 

GQQYIiAMEYVAGIiDIiKRYXKEHYPLSNEEAVRXMGQXIiIiAM^^ 

TQTNSMIiGSVHYI.SPBQARGSKATVQSDXYJ^XXFYEMIiT6HXPYI^DSAVTXJ^ 

ATAKKLTNRYRSVSEMYVDI<SSSLSYKRR19BSKLXFDETSKADTKTI.PI^ 

PQAPKKHRFKMRYLXIiIiASIiVIiVAASLXWXIiSRTPATXiUIPDVAGQT^^ 

GAGTGRKEGTKXNLWSSGKQSPQXSNYVGRKSSDVXABIJCEKKVPI^ 

XVLTVAKKATTXQIi(aNYXGRNSTEVXSBIiKQKKVPBmiXKXEBEESSBSBPGT 

VftMPSYXGSSIiEPTKNNIiXQIVGIKEAOTEVVEVTTAPAGSAEGMVVEQSPRAGBI^ 

SeqXD 215 

MTKLXFHGTPDFSATVLKGIiLTDDRYBXIJWVTQPDIUVVGRKKVXQB^ 
XVTAAFGQFIiPSKLLDSMDFAVNVHASLLPRHRGGAPXHYALXQGDBEAGVTXMBMV^ 
BKIJUiiVGRDLIJ^TLPAyUlGDIKPEPQDTSQOT 
PVBGQGNPGEXLSIGKKELXVATAEGALSIJCQV^PAGKPKMDXASFM 

SeqXD 216 

VFRRIiGQDFQIiRICVKOIiKQXNMiKGKMSSLSDQELVART^ 

AXVMHYGNVABMNTGEGECTLTATMPVYIiKZkFSGEGVNVVTFN^^ 

XYASDXXYTTNSNLGFDYLMDMXiASNBEGKFLRPFNY^ 

YXFKBBKBBVWIiTTKGAKSABNFIiGXDOTliYKBBHASFARHIiVYAXRA 

IjHQAIBAKBHVKIjSPETRAMASXTyQSIiFKMFNKXSCTmSTGKVAEKEFXEl^^ 

YASLEYXKQYHAKGNPLLVFVGSVEMSQLYSSIiLFREGXAHNVIi!^^ 

VAEIiGGLXVXGTERMESQPRXDLQIRGRSGRQGDPGMSKFPVSLEDDVIKKTOPS WHKiCYK^ 
EKAQHASDSAQRSARRQTIiEYAESMNXQRDXVYKBRiraLXTCSRDIiBDVV^ 
VKBVPDyXDVTDKTAVRSFMKQVIDKELSEKKEI«I.NQHDLYEQPI^ 
EYYQEAYAGFEAMKBQXHADMVRNIiIiMGLVEVTPKGEXVTHFP 

SeqXD 217 

MTETVEDKVSHSXTGIiDIIiiraXVAAfiAVXSGTVATOTKVFTNES^ 

SASTSASESASTSASTSASTSASESASTSASTSXSASSTVVGSQTAAATEATAKICVEEDRI^ 

KRSVDSlBQLIiASIKNAAWSGNTXVNQAPAXNASIiNXAKSETKV^ 

KTNDIiGraSSMRPGYSXYNSGTSTQTMLTLGSDliGKPSGVKNYXTDiTO 
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YGLTSSWnTVPIT<OT)TSPTFTPYAARTDRIGINmTGGGKVTO 

SASTSASASASTSASVSASTSASASASTSASASMTSASBSASTSASASASTSASASMTSASASASTSM 

TSASESASTSASASASTSASASASTSASGSASTSTSASASTSASASASTSASASASISASESMTSASBSASTSTSA^ 

SBSASTSASASMTSASASASTSASASMTSASASTSASESMTSASASASTSASASASTSASASASTSASASAST^ 

TSASASASTSMASASTSASBSASTSASASASTSASASASTSASASASTSASASASTSASASMTSASESASTSAS^ 

SASASTSASASASTSASASASTSASASASISASBSASTSASASASTSASASASTSASASASTSASBSASTSASASASTSASAS 

ASTSASASASTSASASASTSASASASTSASASASTSASBSASTSASASASTSASBSASTSASASASTSASASASTSASASASl 

SASASASTSASASASTSASASASTSASASTSASBSASTSASASASTSASASASTSASASASTSASESASTSASA 

ASTSASASASTSASASASTSASASASISASBSASTSASASASTSASVSASTSASASASTSASESASTSASASASTSASESAS1 

SASASASTSASASASISASBSASTSASASASTSASASASTSASASASTSASESASTSTSASASTSASESASTSASASASTSM 

ASASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASl 

SASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASVSASTSASESASTSASASASTSAS 

ASASTSASBSASTSASASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASAM 

STSASASASTSASASASTSASASASTSASASASTSASASASTSASASASISASESASTSASASASTSASASASTSASVSAST^ 

ASASASTSASASASISASESASTSASASASTSASASASTSASASASTSASASASISASBSASTSASASASTSASASASTSM 

SASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASESASTSASAfiASTSASASASTSASA^ 

TSASVSASTSASESASTSASASASTSASASASTSASASASTSASESASTSASASASTSASASASTSASESASTSASASASTS/ 

SASASTSASASASTSASASASASTSASASASTSASASASTSASASAfilSASESASTSASESASTSTSASASTSASBSASTSAS 

ASASTSASASASTSASASASTSASASTSASBSASTSASASASTSASASASTSASASASTSASASASTSASASASTSASVSASl 

SASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASESASTSASASASTSAS 

ASASTSASASASTSASASASTSASASASISASBSASTSASASASTSASASASTSASASASTSASESASTSASASASTSA^^ 

STSASASASTSASASASTSASASASTSASASASTSASBSASTSASASASTSASESASTaASASASTSlWSASASTSAS 

ASASASTSASASASTSASASASTSASASTSASBSASTSASASASTSASASASTSASASASTSASESASTSASASASTSASASi 

STSASASASTSASASASTSASASASISASESASTSASASASTSASVSASTSASASASTSASESASTSASASASTSASESAST5 

ASASASTSASASASISASESASTSASASASTSASASASTSASASASTSASBSASTSTSASASTSASESASTSASASASTSASi 

SASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTi 

ASASASTSASASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASVSASTSM 

SASTSASASASTSASASASTSASASASTSASBSASTSASASASTSASASASTSASBSASTSASASASTSASASASTSAS^ 

TSASASASASTSASASASTSASASASTSASASASISASESASTSASASASASTSASASASTSASASASTSASASASISASES2 

STSASESASTSTSASASTSASESASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSAi 

ASASTSASASASTSASASASTSASVSASTSASASASTSASASASTSASESASTSASASTSASESASTSASASASTSASASAS 

SASASASTSASBSASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASESASTSASASAS 

ESASTSASASASTSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTSASBSASTSTSABASTSASES2 

STSASASASTSASASASTSASASASTSASASTSASBSASTSASASASTSASASASTSASASASTSASASASTSASVSASTSAi 

ASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASESASTSASASASTSM 

STSASASASTSASASASTSASASASISASESASTSASASASTSASASASTSASASASTSASBSASTSASASASTSASASAS 

ASASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASESASTSASASASTSASASASTSASASAS 

SASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSASBSASTSASASASTSASASM 

ASASASTSASASASTSASASASlSASBSASTSASASASTSASVSASTSASASASTSASESASTSASASASTSASESASTSASi 

SASTSASASASISASBSASTSASASASTSASASASTSASASASTSASESASTSTSASASTSASBSASTSASASASTSASASA.* 

TSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASi 

SASTSASASASTSASBSASTSASASASTSASASASTSASASASTSASASASTSASVSASTSASESASTSASASASTSASAS^ 

TSASBSASTSASASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASA£U!^TSASASA^ 

SASASTSASAJ5ASTSASASASTSASASASTSASASASTSASASASISASESASTSASASASTSASASASTSASVSASTS 

ASTSASASASISASBSASTSASASASTSASASASTSASASASTSASASASXSASESASTSASASASTSASASASTSASASAS^ 

SASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASASASISASBSASTSA2 

ASASTSASASASTSASASASTSASESASTSTSASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASAS^ 

SASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASBSASTSA^ 

ASASTSASASASTSASASASTSASASASTSASVSASTSASBSASTSASASASTSASASASTSASBSASTSASASASTSASBSi 

STSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTi 

ASASASTSASASASTSASASASISASBSASTSASASASTSASASASTSASVSASTSASASASTSASASASISASBSASTSASi 

SASTSASASASTSASASASTSASASASISASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASA^ 

TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS2^SASTSASASASTSVSNSAN^ 

STGKSQKEIiPNTGTBSSXGSVIjLGVIiAAVTGXGIiVAKRRKRDE^ 

SeqID 218 

MSNBKNTimn^KKDATVVAHEIKGELTYEDKVTQKIIGLSLENVSGI^ 

VDU!mVEYQKim>ALYSElREIVSSBVAKMTDLEXVEINVNVTO 

GDGSGFSTVQEKVSEGVEAVKGAANGWSHENTRVN 

SeqID 219 

^TI^CEKNVILTARD^VVEFDVM5KVLTAIRGVSIiELVBGEVIlALVGE 

SSHKDWEQIRGAKIATIFQDP^^•SLDPIKTIGSQITEVIVKHQGKTAKEAKEIIAIDYMNKV^ 
RIVIAXMiACTPDVIilCDEPTTAIjDWIQAQIIDIjIiKSIjQNEYHFTT^ 

DPRHPYTWSIiLSSIiPQIiADDKGDLYSIPGTPPSLYTDIiKGDAFAIiRSDYAMQXDBWKAPQFSVSET^ 
KPAVXANIjHDKXREKMGFAHIiAD 

SeqXD 220 
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MKIIiniVFATAQLVLIiAAGVIiAACSSSECSSDSSAPI^^ 

EDWEVSKDGLTYTYKIIUCGVKSTFTSDGEBTAEVTi^FVN^^ 

DYTLQYTIiNQPEPFMNSKLTYSlFWPIjNEEFETSKGSDFAKPTDFTSIiLYllGPF^^ 

INIAYYTOSDQESIiSRNFTSGAYSYARI«yPTSSNYSK\rftBEyKDNIYyTQS6S6XA6Irf^^ 

Alil^inaDFRQAIiNFAIiDRSAYSAQIlKSKDGAAIiAVR^^ 

AKABFiOCAiaCAZjiSaDGVQFPIHIiDVPVDQASKNyXSR^ 

SGGVSWGPDYQDPSTYIiDIIiKTTSSETTKTYIjGFDNPNSPSWQVGLI^^ 

FIPiU4ASSGAAFVLSRIVPFTGASAQT6SK6SDVYFKyiiKSQDKV^ 

SeqID 221 

MBINVSKWTDLPQVGVQPYRQVHMSTGNPHSTVQNEADYHN^^ 
AAVELIBSHSTKEBFMTDYRLYXEIiLRNIiADEAGLPKTZJ^ 
DIENGIiTXEl^QKNDTGYnYVHSDGSYPKSKFBKINGTWYYFDSSGYMI^ 
YFNEEG2^MKTGWVKYiayrWYYU>AKEGAMVSNI^IQSi^ 

SeqXD 222 

MKKKYWTIiAXLFFCLFNNSWAQEXPKNItDGNXTinrOT 
KNCCKI^LEPQXNMDXVNSESNNIiIX?ED29IiDNKXKB^ 

NSKVSXAXLDSGVDIiQimSIiLKNLSNHSKNYVPNKGYLGKESGEBGXXSDXQDI^ 
lUFGKSSASPDWXVKAXFDAVDDGNDXXin^STGQYIiMXDGEYETO 
SDIiIiIOiiXSSRKKVRKPGLVVDVPSYFSSTXSVGGXDRIiSNLSDFSl^ 
SATLGGYTYLYGNSFAAPK\^GAXAMXXDKYKLiaX2PYNYMFVKKFNB^ 

SeqXD 223 

MKKDELFBGFYLXKSADIJlQTRAGKimiAFTFQDDSGEXDGKLWI^ 
QAGEPNDPADFKVKSPVDVKBXIO^YMSC^FKXENPWQRXVRimYTKYD^ 
XSEVYPQLNKSIJLYA6XMIlHDIIAK^^EI•TGPDQTEYTV^ 
EYGSPVRPRXMEAEXXHMXDmJ>ASMM^SMSTA£laLVDKGEMT^^ 

SeqXD 224 

VTXI^KDTVQQSAKGESVTQEATPEYIO^EOTPGGDKG^mSSSDANA^^ 
NAXEKAAKDKQDEXKGAPLSDKEKAEIlIiARVBAEKQAALKEXENAKT^^ 
PQATAGTMQDVTYQSPAGKQLPinrGSASSAAXiASIiGLVVATSGFAIiIXSRKTIl^^ 

SeqXD 225 

MNADDTVTXYDVARBAGVSMATVSRVVNG»IKNVKENTRK^^ 

KGXDDXAEHYKYNXVIANSDEDNEKEVSVVimiFSKQVDGXXYmYHLTDKXRSEF 

QATXDAVSYXAKENERXAFVSGPIiVDDXNGKViaiVGYiaEnrLiaU^ 

BIiAAGVLNGLADKGVSVPEDFBXITSDDSQXSRFTRPNLTTXAQPIiYDIiQAISMR^a^ 

RKRK 

SeqXD 226 

MKKIOjVFPNIiFnWGAASSGPQTEGQYGKVHENVHDYWFKTHPEDFro 

RLXKiniiBTGBPDPKGXAFYNAXXEBAKKNQMDLVMNIiHH^ 

mPMVXPEAOYIjYAEHYPMIiKGRGKBAVQVIYNLNLASAKVXQL^^ 

NKVFIiNPAVKGTFPERXiVKQLERDGVXiWSHTEKEIiQLMKSMTVD 

RMNPYRGWEXFPKAXYDXAMXVKEEYGIiXPINrFXSENCaaGVlSNEARFXDENGV^ 

AWTAFDCWSWNNAYKNRYGFXSVDIiBTQK3lTXKSS6RWYRKVSDNNGFEVBIBB 

SeqXD 227 

VBNLTNFYEKYKVYLTRPRIiBIiIiAVVTXVFCAVLVFF 

NGAFNGKGTFQSKEGWTYEGDFVNGQABGKGKLTTEQSWyEGTFKQGVFQQK 
SeqXD 228 

MIiinCXRDYIiDFAGIiQYRNPDKAGAEREKMLAFRHKX3QEARIG^ 

GaVTBPMMAIJU.YGTSTDPGISLEVSPIBRKKDBQTIiGKQAKVXiDXPTVKGXY^ 

VRKVIiVKVDVPMTENSSBBBXVEGIiLKSYSKILPYYIiATRK 

SeqXD 229 

MVQNSCWQSKSHKVKAFTiaJBSLIiALXVISGGIiIiLFQAMS^ 

MKQDGSDXAXGKSKSDDFRKTNARGRGYQPMVYGIiKSVRXTEDNQLVRFHFQFQKGItEREF 
SeqXD 230 

MKKMOTFIiKKAKVKAFTLVEMIiVVIiLXXSVLFLIiFVPNLTKQKEA 
XTEEQAKAYKEYI9DKNGGANRKVND 



SeqXD 231 
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M^SKVRKAVIPiU^GLGTRFLPATKAIJ^CE^^ 
KTDI^IjKLVDKTTDMRIiHFIRQTHPRGLGnAV^ 

PHDEVSAYGVIAPQGEGKDGIiYSVBTFTOKPAPEnAPSDIAIIGRYIiLTPBIPEII^ 
iO^FKGARYDVGDKFGFMKTSXDYiUiiKHPQVKDDLKKY^ 

SeqXD 232 

MQNQIJCraLKRKMLEFFQQKQKNKKSARPGKKGSSTKKSKTLDKST^ 

FDKVRVPQTEELVNQVroiSSISEITYSDGT\aASIESDIjLRTSISSEQISENIiKl^ 

KFVGIiGSSSGGSTLTQQIiIKQQVVGDAPTIiARKAAErVDAIi^ 

VDASQIiTVPQAJOTjAGLPQSPITYSPYENTGEIJCSDEDIiEIGIiRRJ^^ 

GXSRDYIiYFTTIJ^BAQERMyDYIiAQRDNVSAianiKNEATO 

GTGRVEV(SrVIiMDNQTG2U:LGEVGGRNYQENQE3N^^ 

ANSKGTGMMTLGBALNYSWNIPAYWTYRraJffiKGVDVKGTOB™ 

QKHVISKIEAAIXSRVVYEyQDKPVQVYSKATATlMQGIiLREVIiSSRVTTTO 

ItSTPRIiTI^GWIGBDDiraSIiSRRAGYSNNSNyMAHIjVNAXQQASPSIWGNERFAIiDP^ 

VTGSTVTSYWANKSGAPATSYRFAIGGSDADYQNANSSIVGSLPTPSSSSS^ 

SeqID 233 

MSSKF^aCSAAVLGTATIlASIJiLVACGSKTADKPADSGSSEVI^ 

KLSLDNQSGNVPDVMMAPYDRVGSIiGSD6QIiSEVKI>SIX3aiC^ 

ADIjENLAKDSKYAFAGEDGKTTAFIiADWTNFYYTYGIiIiAGN^ 

TBGAONLlQTQFQEGKTAAIlIXSPWKAQAFia^AKVinrG^ 

QQICVIiYDKTMEIPAllTEARSYABGKin)BLTTAVIECQFKMT^ 

TIKQKFGE 

SegID 234 

HXDKVVIQTLIXTFFFCSimKXIXFLTTlLT^^ 

RKEKGHA]S^RIjIiQmi:GKEPXXTGVYXGPDNWEVVDVDEEWVKI^ 
SeqXD 235 

MXIiSKNREDGIiRKFATNXRIJmiRTIiNHI^ 

LircFFDKBFIiYSIJmTGTKI>PSHPDRNI.TPGXDimX3SXi6Q6XSV^^ 

SHQQLSNLXVFVDDinCKQLDGFTia>XC3IPGDFVEKFSAFGFESXRWGSDXREXYEG 

EIiBEMKSNHHIaEUPTVEEKQMLTSWERLSQEIiEETE 

SeqXD 236 

MiaCTTXLSliTTAAVIIiAAYVPiraPXIiADTPSSBVIKBT^ 

TXPTKXTVGDKVFTVTEVASQZ^SYYPDETGRIVYYPSSITIPSSXKKIQKKGPHGSI^^ 

EEXELPASI^XGTSAFSFSQKIiKKLTFSSSSKLEIjXSHEAFAMLSiniEKLTLFKSW 

ASVDGVIiPSKDKTQLIYYPSQKNDESYKTPKETKEIASYSFKKKSYliKKIiEU^^ 

RZiAFYCailiELKELXLPDNVKNFGKHVMNGLPKIiKSIiTXGMNXNSL 

FYVTSEHXKDVLKSMLSTSIO^XXVEKVDNXKQETDVAKPKKNSNQGVVGW 

GSMATGWVKDKGLWYYIiNESGSMATGWVKDKGXinYYZiNnSSGSS^ 

ATGWVTVSGKWYYTYNSGDLLVNTTTPDGYRVMANGEWVG 

SeqXD 237 

MVRFTGLSLKQTQAXEVIJECGHXSLPDVEVAWQSDQASXSXEGEEGHYQLT^ 

EDIiAYMVDCSRmVIiNVASAKQMXEXLALMGYSTFBLYHEDTYQIEG^ 

IJUiliSAFVKWGVKEVQEXiRDVEDXIiLXGEEKVYDIiXDGMFATLSK^ 

ERVIa^XADKYGFHCQ^mSDMFFKIJ^aSADGQYDRDVEXPEETRVYIlDRl^ 

AGGAWKWIGPTPH!raFSRIjVAIBANKACRANQIKBVr\m3WGDNGGKTAQPSILPSIiQXW2^ 

VEDFHQXDIANZiIiPDXiPGmiSGXNPNRYVFyQDlLCPXIiDQHMTPEQDKPHFAQAAET^ 

SSECTOVGRRXIlQAYQADDKBSIiQQXARQEIiPEIiRSQXBDFHALFSHQmiKEl^^ 

GQXiDRXDEIiEVEXLPFTDFYADKDFAATTANQWHTXATASTXYTT 

SeqXD 238 

MSNSFVKIJiVSQLFANIJU3XFFRVTXXANXYXXSKSVXATSI>VPXLXGXSSFVASLLVPLW 

XIiVGHFTVHQSVAPIiVTYIiFVVAXSXIiDGFAAPVSYAXVPRYATDLGKANSA^ 

NIiVLYXXSSFIiMIiFIiPIIAEVBVLESBTmiEILIiKGWKLVARl^ 

YSinrAYSXGXXXSGIiXAFRIiSEKFIJUaCWEPQLFTPNIiKTXQNPCIiSIjDPGWFIjFS 

THMNSlJ»NHHFQNKSFYQIiSPIX3GHLTQYGGLIFFQEIiFSQIiKLKERIS 

YACKEIiSADAYFPKIiIiEGGQIiASQPTLSRPLSRTDBETVHSIiRC^^ 

AHYRAmYHPLYAFEGKTGYCFNAQIAPGNRYCSEBADSFXTPVLERFNQI^ 

VIiSRLGDIiSLPCPQDEDIiTXIiPHSAYSETLYQAGSWSHKRRVCQFSERKEGBnjFYDVXSLVl^^ 

NFXKEMKEGFFGDKTDSSTIlXKJ!VEVR^I^!^^CXAYl^lYIlFI^^ 

YSELFSAIlYSRXRK^mLNLFVPYEPPRRKASL^!HH 



SeqXD 239 
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MFASKSERKVHgSIiaCFSVgVaSVVVASI.VMGSVVHATEN^ 

KIVGESYAKSTKKRHTITVMiVNBIJaNIKNB^ 

TAKPNKPTBPGBKVAEAKiaCVEKAKKKI^QK^ 

AEVESKQABATiaiKKIKTDHEEAEEEAKRWU>AKEQGKPKGI^ 

VASAEKSCVBEAKKKABDQKEEDRBNYFTNT^ 

BKIKTDMCKABEBAKRroUkEEDKVKEKPAEQPQPAPAPKAECT 

QQPPKTBKPMPSTPKTGWKQENGMWyFYNTDGSMATGSffM 

GSWrniN™GSMATGWIjQYl!K3SWra-NAiraSMATGWI^ 

WYYVNGSGAIiAVin^TVIXSYGVXIANGEWVN 

SeqID 240 

MNYSKALNECIESAYMVMHPGa^YliESTOaiLIAM 

IjQVLPDEAEYVMVVffiUOTiGTEHVIiYAIIjHDSN^^ 

TVADKQNSMANMMGMPQTPSGGLEDYTHDLTEQARSGKI^PVIGro 

RIASGDVPi^JEMAKMRVIiEIiDIjMNVVAGTRFRGDFEER^ 

RGTIiRTVGATTQEEYQKHIBmAAIiSRRPiUCVTIEEPSYaDSMriLQ^^ 

DSAIDLZiDEAAATVQNKAKHVKADDSDLSPADKAIiMDGKWKC^ 

DiUCKYLNIiBAEIiHKRVXGQDQAVSSISRAlRSNQSGIRSHK^ 

^!EKFAASRI*lK3APPGYVGyEEGGELTKKraNKPYSVIi 

GATALRDDKTVGPGATOIRPDQENMEKRMPEEIiKKAYR^ 

QASiUja«IANQGYDPEMGJ^RPIilU5^TIiQT^^ 

SeqZD 241 

MKILPFIMGTSYYIiKMSVKKLVPPLVVOIiPlIA^ 

SNVNGFEIPAAYGNANBWGHRARRBGYRVDNTPTIGSITWSTAGTyGHVAWVSNVMC^ 

TGFIHFrorJXSGSVGNSQSSTSTGGTHYFKTKSAIKTEPIASGTV^ 

EAVIlKNPLGNSVIiSSTGGTHyPKTKSAIKTEPbVSATV^ 

NyQNQSGNISSYOSHSSSTTOWKKINOSWYHFKSNSSKSTGWLKDGSSWYYIiKIiSGBMff 
QVSGKWYYSYSSGAIJVVNTTVDGYRVNSDGERV 

SeqlD 242 

^9KVIFIJ^>VKGKGKRGEIKEVPTGYAQNFLlKKNIl^^ 
KVGPTCRTFGSITNKKXAEEIjQKQFGIKIDKRHXQVQZ^IRAVGLIDVPVI^ 

SegID 243 

MKKKXZJkSItliliSTVMVSQVAVLTTAKMSTTDDK^^ 

Kia^GEITELSKNZVSRNQSIiEKQARSAQTNGAWSYlMTXVNSKSlTE2^SRV^ 
AMNDAXNTVXAXrOQIOiADBAQAIiTTKQAEI^KAAEI^ 

IJ^AimiljTAQVQAVSESAAAPVRAKVRPTYSTEQ^SYPXGECTWGVKT^ 
lACanaDGGYGmTAVVTAVESTTRXQVSESNYAGimTXGMH^ 

SegXD 244 

MVKRRXRRGTREPEKVVVPEQSSXPSYPVSVTSNQGTDVAVEPAKAVAPTTDVnCQEl^ 
NSlSGSMKVNQWFQVGGKWYYVin'SGBLAVOTSXDGYRVl^^ 

SeqXD 245 

BIlRRLSRIiVDQELYFGCGVnalSIlEWLPSMRKDSHPSNTAPRTT^^ 
SegID 246 

DCXRKQPFTRDEPNKTCRKTKPSKSYCSYRW 
SegXD 247 

GQRNPRRXERVXRMAETKPRXSKREG 
SegXD 248 

QRIOIiFKXFHLFQKKSGWNQKSSCIjKIiNIiNSIiNRK^^ 
SegXD 249 

LVXXVXiKIQSKSETDFXFKTWPFXLLSKXXPItMVXiDCQVSXSWTNREa^ 
SWYPHDNADSTRXMAFSRKGC 

SegXD 250 

ERLPAFPRSZiSGRICIjDQGGTKEKGSDGRSP 
SegXD 251 

RNCLSTWKSSSNYHTEXiCRGTVRQCMICGRPKEVYSADYAQQSYEIi^^ 

VNVGXSTrSTYWIHHGRIiGLSKQDIiLYPRKGKAIiKKQASTMFKPAGQSXEQRPBA^^ 

DRKSRHQXXRLXPNKSASVVNQAIiKLXIiKQHniiSXTADNGTEFNRIiFDXFSEE^ 
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SeqlD 252 

PVMTXSSPTMKNMDIjSTKASPSQPI>Q6KH6MXWSGK 
SeqID 253 

TSSXRlHTRKSSPNWTTTPHLAIiSAETN 
SeqID 254 

YFIiFIOnrARESLSLPSTNKXIiHRKQGS 



SeqID 255 

AAPKKDQINERVEKLGKLKPITINraGKSEVIDSKEKLQBIiMN^ 

SeqID 256 

AYAHSKRSA6SGRAG6RQCLCQCQNKCRRDFKY 
SeqID 257 

HGRPYHKPHQPHHHGFPQQSYNLLPPKHKPTLCVRR 



KGKIIiLJJPRLTTQRWQRKIlOTSRKSJ^NNIU^^ 
SPKEDDXiXHDD2VILVRFGIZiEVin3SPYEIiIjIiLYHTHSYRFSCSIYI>S 

SeqID 259 

FWSHVFLLYLSFm>RPKSMSMPTSSKIiIiRPRFRTPIISASDFSVKSCrn^ 
ASFITSVDIAISVRFVNKSRCSVKIRAESPKASSGIIVPFVKISKVOTiSKPSIiL 

SeqID 260 

IiVCMKNKGCYKERNNCCH 

SeqID 261 

FHYLSKYFLVSAITTGDKTKRAIKFGRAIKALTISAI IQTISNSIOTPNKTMSTKTTR 

SeqID 262 

GKKVPIKYPIiSRVSSKTGPraTGRTKDKIVDKKVGCPPBKSTVKySS 
SeqID 263 

SSPVFPKXiVMVSGANKPRSEaOTFPFSSKMSFHIiTFVIi 
SeqID 264 

yLTSPSVPKIASSKVia.TR™RSSPWRGAPGIiREBPPPPKKIiEKISSKPPKPPM 
SeqID 265 

QSWRPIPDSKCYTQBKLTIPIKRRKDIKDFYHNSIQRHKNSHKSHIJiiDSyRLI 
YBVNRV 

SeqID 266 

IiFRFYRVIVLYR6Ninrnj:4lLVNLQyVQNVFRKDRFLVRGJ!^^ 
SeqID 267 

ITHPPIiNPEHFVSRVFSSI/SLKSYQPKDDRFIiRKPSDSRHPESGNSGKMQVIiNS 
SeqID 268 

TLAKAVGI<HYSPPIPPKPFIiGRITTDSSSI 
SeqID 269 

PGSPFSBISQAGFFGV3UCRIPPRPPRPPWATINSCPCSIKSVKTLPVSASRTVVPCG 
SeqID 270 

INSLTIATSIiSKRRAPRKASKASPRMVSRIiRPPDFSSPIiPmilNWSim 
BINNCRTASPR3VSKRSLCEIFKRRCSIA 

SeqID 271 

PIGKRNC3CABCK2SHiniIiEKQKTFC2SRK3^YGASPEPRYRESI^ 
SeqID 272 

QPLGHSKABEHETICSHTFDinn^TBTIPNQVKGRDMTSSETLPFPSKNQNC^ 
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SeqID 273 

PCSIiPDYGLVGSGYHSCaiYQSNDTRFIiBSHGNWRmjYSWSWlLCQE 
TFFFTHXDKVLVQAHIISQFSmKRTYQHIFETiGCNMLrVHCC 

SeqID 274 

RVKDmiliDKLVKAIiItKimSSTQYYIHQIJiRK^ 
SeqID 275 

QVIiaDlATTinCTESVKSQSBREKKRIiTSSNXZjKVRGRPX 
SegXD 276 

AFKSSKVPSLDPSSTKTYSZSVSKSGSKASRA 
SeglD 277 

DK^DPIARKI«PDKSKPSTSFCTKSIlSFVN^SASIlTa2^CPSKTI^ 
SeqXD 278 

LCaOiQTQARPRGSVTinjTKQNKVTRyiiN^ 
SegXD 279 

CXQSI6^rEGQCK(aTSCYVGKBIHLAPXSDIV6HKGKEEGDDGIn^DGRQFyIlFI^^ 
GRIiXSKDXXNSTGQNGXllNTKVXSPFA 

SeqXD 280 

TSTKLVIDTTTFMrFCTNm!KSSKFTOT!PT^^ 
TSFiniYCTNQDRIjTIiLVTSI^BmrDNRFiaj^^ 

SeqXD 281 

XSTGVPTCSMYPIiIiKTAIIiSDKVKASS 
SeqXD 282 

LSTMSSXKSXLCSIiPTHTXFSKSAXSPKRGSTAX 
SeqXD 283 

SCSYSSTNSKACSXVGLSKPTXFXCVTPXSAAK^^XSSASRPi^SSTFKCVCVSKXXSSNLV^ 
NREIiFCYIiFYPXIiPXSYSTVRDRRDWIffl 

SeqXD 284 

WSCXVPAVACGAIiVVIjGAAU3ATGLIX3TVT^ 
SCLSIiAAFSMAFIiADSFSVANCLEACD 

SeqXD 285 

YSPFIOISXLXRKTTKXXNPNPIG^RMNWRSICVWSNQPVNXST^ 
SeqXD 286 

DYPKFRTTPTRFSTVKPYSANTPGAGABAPKVSIPRTAPSRPTYLYQF 
SeqXD 287 

RGRRGLCVARXKAPRLVXKPKRTXDPPTKDRYSPPLSATSLSVPIOSPXXSFPAKMDK^ 
SeqXD 288 

TPPYTKXPAKTAXXPFXSAQDFNQAQRLSG 



SeqXD 289 

ATGAAGAAAAAAATCTOAGOGTO^CTTTTATTAAGTACAGTAATGGT^ 

AACGACTGATGACAAAATTGCTGCTCAAGATAATAA 

ACCAAATTCAGGAGCaUVGTATCAGCTATTCAAGCTGAGCAGTCTAAC^ 

AAGAAACTCGAGGGTGAGATTACAGAACTTTCTAAAAACATTGT^ 

TCAAACyU^TGGAGCOSTAACTAGCTATATCAATACCATTGTAAACTC^ 

a^TGAGTGAAATCGTATCTGCAAACAACAAAATGlTAGAACAACAAAATC 

GCAAATAATGATGCTATCAATACTGTAATTGCTAATCAACAAAAATTGG 

AGAACTAA?^GCT6CTQAATTAAGTCTTGCTGCTGAGAAAGC^ 

GAGCy^GCnxSAGGCAGAGGCTTOTGCAGCTGC 

CTTGCTTCAGCAAACACTAACn'AACAGCTCAAGTGCAAGC^ 

TCCAACATACyiGTAC^AACXJCTTCMGTTATCCAATTGGAGAATC 

ACTACTGGGGTAATGGAGOlCa^TGGGCTACSAGTGCAGC^ 

ATTGCATCTTGGAATGATGGTGGATATGGTCACOTAGCGGTTGTTACAGCT^^ 
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ATC2UUlTTATGa«3GTAATOGTAaU^TTGGAAATC3lC 
ATATTTATGCAGAT 

SeqXD 290 

ATGAAGAAAAAAATCOTASOOTCACrTTTAT^^ 

AAOGACTGATGAOUWUlTTGCTGCTCAAGATAATAAAAra 

ACCAAATTCAGGAGCaUiGTATCMCTATTCAAGCTGMCAOTC^^ 

AAGAAACTOSLAGGGTGAGATTACAGAACTTTCTAAAAAC^ 

TCaAACAAATGGAGCCGTAACTAGCTATATOkATACaiTTGTi^ 

CAATGAGTGAAATCGTATCTGOM^CAACAAAATGTTAGiUVCAACA^^ 

GCAAAyAATGATGCTATCaATACTGTAATTSCPAATCAACAAAi^ 

AGAACTAAAAGCTCKrimATTAAGTCTTGCIGCT^ 

CAGCAOCTGAGGCAGAGGCIXXSTGCAGCl^^ 

CTTGCTTCMCSUUICACTAACTTAACAGCTCAASTGCA^ 

TCCAACATACAGTACAAAC^CTTCAAGTTATCCAATTGGAGA^ 

ACTACTGGGGTAATGGAGCACAGTGGGCTAOUlGTGCa^ 

ATTGCATGTTGGAATGATGGTGOATATGGTCACOTAGCGG^^ 

ATO^AATTATGCAGGTAATOOTAa^TTGQAAATaiCCGTGG^ 

ATATTTATGCAGAT 

SeqID 291 

ATGATCX:!AAATCGGC!AAGATTTTTGCCGGAC6C^ 

AGCCAAAGACTTAATCTTAGATGGGGAAGAAGTGGGAGTGAAGGT^ 

CTOSTTTTCa^CGTGAAGCXaAGAGCTATGGCAGATCnaiG^ 

ggtcaagagtacctagctatggagtatgtggctggactggacctca;^ 

agaagcagccratatcatgggacaaattctct^ 

ctcaaaatatcctcttgacaccagatgggacraccaaggl^ 

acccagactaactcx3at6ttgggcrcagttcattacttgtcac 

tatctatgccatggggattattttctatgagatgctgacaggccatatc<^^ 

tccagcatttccagaaacccctgcogtcogttattgcagaaaatccatc^ 

GCUy^CTGCTAAAAAGTOXlACCAATCGCTACaKnrC^ 

TAGAAATGAAAGTAAGTTAATCTTTGATGAAAOGAGCAAGGa^ 

GATCTATTCCTAAGGTTCAAGOSCAAACAGAACACAAATC^ 

CCACAAGCACGGAAAAAACATAGATTTAAGATGOGl^ACCri^ 

TTGGATACTATCKZAGAACTCCTGCAACaiTTGCCATTCCAGATGTGGCAGGT^ 

AAAAAGCCAATTTTGAGATTGGTGAGGAGAAGACAGAGGCTAGTG^ 

GGOSCnXSGAACTGGTCGAAAAGAAGGMCGAAAATCAATT^^ 

TGTCiGGTCGGAAATCCTCTGATGTaiTTGCGGAATTAAAAGAGAAAAAAG 

AGTCGAATGAGAGTGAGGCTGGAACGGTCCTGAAGCAAAGTCr^ 

ATTGTCTTGACAGTAGCTAAAAAAGTTACAAGTGTTGCCATGCCGM 

TTTGATTCAAATTGTTGGGATTAAGGAAGCTAATATAGAAGT^ 

TGGTTGTTGAACAAAGTCCTAGAGCAGOTGAAAAGGTAGACC^^ 

ACAACTTCAGCTACTCCT 

SegID 292 

ATGATCCAAATOSGCAAGATTTTTGCCGGACGCTATCGG^ 

AGCCSUU^GACTTAATCTTAGATGGGGAAGAAGT^ 

CTCGTTTTCSUSCXSTGAAGCGAGAGCTATGGCAGATCTAGACm 

GGTCaACAGTATCTTGCAATGGAGTATGTTGCTGGAC^^ 

AGAAGGAGTCa^TATCATGGGACAAATTCTCTTGGCTATG^^ 

CTCAAAATATCCTTTTGACACCAGATGGGACGGCCAAGGTCACA^ 

ACCCAGACTAACTCGATGTTGGGCircAGTTCATTAC^^ 

TATCTATGCCATGGGGATTATTTTCTATGAGATGraSAOkGGCa^ 

TCCAGCATTTCCAGAAACCCCTGCCXSTCaSTTATTGCAGAAAATCCATCTG 

GGIU1CT6CTAAAAAGTTGACCAATCGCTACC6CTCGGTTT 

TAGAAATGAAAGTAAGTTAATCTTTGATGAAAOGAGO^^ 

CATCTATTCCTAAGGTTC»AGCGCAGAC!AGAAGIIC^^ 

CO^CAAGCaCCGAAAAAACATAGATTTAAQATGOTTTACCT^ 

TTGGATACTATCXIAGAACTCCTGCAACCATTGCCATTCC^^ 

AAAAA6CCAATTTTOAGATTGGTGAGGAGAAGAGAGAGGCT 

GGOXITGGAACTGGTCGAAAAGAAGGAACGAAAATCAATTTGGTTGTCTC^ 

TGTCGGCCGGAAATCTTCTGATGTTATOSa^SAATTAAAAGAGAAAAAS^ 

AGTCGAATGAGAGTGAGGCTGGAACGGTCCTQA2\GCAAAGTCTACGAG 

ATTGTTTTGACAGTAGCTAAAAAAGCTAOGACGATTCAATTAGG^^ 

ACTCAAGCAGAA6AAG6TTCCTGAGAATTT6ATTAAGATAGA 

AACAAAGTCCAGGTGCCGGAACGACTTATGATGTGAGTAAACCia^CT^ 

GTTGCCATGCCGAGTTAOVTTGGTTCCaGCTTGGAGT^ 
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TATAGAAGTTGTAQAAGTGACmaiGCGa^TOCAGGTM 
AGGTAGACCTAAATAAGACTAGAGTCAIWSATTTCAATCTAC^^ 



ATGATCCAAATCGGCUUW3ATTTTTGCCX3^ 

AGCCAAAGACT^AATCTTAGATGGGGAAGftAGTGGCAGT^^ 

CTCGTTTTCAGCGTGAACX3GAGAGCTATGGCAGATCTAGACa^ 

GGTCAACAGTATCTTGCJWlTGGAGTATGTTGCTGCacn^ 

AGAAGCAGTCOGTATCaTGGGAClUU^TCCTCCTAGCaiTGOGTT^^ 

CTCAAAATATCCTTTTOACaCCMATGGGACTGCCaA 

ACCCAGACTAACTOSATGTTGGGCTCAGTTCATTACTTGTTC^ 

TATCTATGCTATGGGGATTATTTTCTATGftGATGTTGA^^ 

TCCAGCATTTCCAGBAACCCCTGCCGTCOTTTATTGCAGI^AAATCCAT^ 

GCAACTGCrAAAAAGTTGACCAATOKTrACCGCT 

TAGAAATGAAAGTAAGTTAATCTTTGATGAAACGAGCMGGCaGATACC^ 

CATCTATTCCTAAGGTT<:a^GCX3(:a.GAaVGAACaC^^ 

CCACAAGCACCGAAAAAACATAGATTTAAGATGCGTTACCTGATT^ 

TTGGATACTATCCAGAACTCCTGCAACCATTGCCATTCCAGATGTGGaW^ 

AAAAAGCCAATTTTGAGATTGGTGAGGAGAAGAOlGAGGCTAGTGAAAAGGa^ 

GGCGCTGGAACTGGTCGAAAAGAAGGAACXSAAAATCAATCTGGTTGTCTCATC^^ 

TGTCGGCCGGAAATCTTCTGATGTTATCGCGGAATTAAAAGAGAAAAAAGTTCCa^ 

AGTCGAATGAGAGTGAGGCTGGAACGGTCCTGAAGCMAGTCTACa^^ 

ATTGTTTTGAGAGTAGCTAAAAAAGCTACGACGATTa^TT^ 

ACTGAAGCAGAAGAAGGTTCCrGAGAATTTGATTAAQATAGAGGAAGAJ^^ 

AACMAGTCCAGGTGCCXSGAACGACTTATGATGTGAGTAAAC 

GTTGCCATGCCGAGTTACATTGGTTCC:a.GCTTGGAGTTTACTAAGAACaATT^ 

TATAGAAGTTGTAGAAGTGACGAOVGCGCCTGCAGGTAGTGCaGZUtfSG 

AGGTAGACCTCa^TAAGACTAGAGTCAAGACTTCAATCTACAAACC^^ 

SeqXD 294 

ATGATCaVAATOSGCAAGATTTTTGCCGGACGCTATCGGAT^ 

AGCCy^AAGACTTAATCTTAGATGGGGAAGAAGTGGGAGTCAAG^^ 

CTCGTTTTCAGOSTGAAGCGAGAGCrATGOCAGATCTAGACCATCCT^ 

GGTCAAGAGTATCTTGCAATGGAGTATOTTGCTGGACTAQACCTC^^ 

AGAAGCAGTCCGTATCATGGGACAAATTCTCTTGGCTATGCGC^^ 

CTCAAAATATCCTTTTGACaiCCAGATGGGACGGCCAlW^^ 

ACCO^GACTAACTCGATGTTGGGCTCAGTTC^TTAC^^ 

TATCTATGCCaVTGGGGATTATTTTCTATGAGATGTTGACAGGCCaiT^ 

TCCAGCATTTCawaAAACCCCIXKX!GTCa3TTATTGCAGAAA^ 

GCAACTGCTAAAAAGTTGACa\ATCX3CTATCX3CTCXKm?^^ 

TAGAAATGAAAGTAAGTTAATCTTTGATGAAACXSAGCAAGGaiGATACa^ 

CATCTATTCCTAAGGTrCAAGOSCAGACAGGACAGAAATa^TC^ 

CCACAAGCACCGAAAAAAGATAGATTTAAGATGCGTTACCT^TTTTGTO^^ 

TTGGATACTATCCAGAACTCCTOCAACCATTGCCATTCCAGATGTGGC^ 

AAAAAGCCAATTTTGAGATTGGTGAGGAGAAGACAGAGGCTAGTGAAAAGGTGGAAG^ 

GGCGCTGGAACTGGTCGAAAAGAAGGAACGAAAATTAATCTGGTTGTCTCa^TCA 

TGTCGGCOSGAAATCTTCTGATGTTATCGCGGAATTAAAAfiAGAAAAAAGTTCCA^ 

AGTCGAATGAGAGTGAGGCTX3GAAa^TCCTGAAGCAAAGTCTACCA<^^ 

ATTATTTTGACAGTAGCTAAAAAAGCrACX3A<XATTCUlATTAG^^ 

ACTCAAGCAGAAGAAGGTTCCTGAGAATTTGATTAAGATAGAGGAA^^ 

AACaVAAGTCCAGGT6CCX3GAAC!GACOTATGAT6TGAG^^ 

GTTGCCATGCCGAGTTACATTGGTTCCAGCTTOGAGTTT^ 

TATAGAAGTTGTAGAAGTGACXSACAGGGCCmCAGGTAGTGC^^ 

AGGTAGACCTAAATAAGACTAGAGTCAAGATTTCSUITC^^ 

SeqID 295 

ATGATCCAAATCXKSCSU^TTTTTGCCSGGAraCTAT^ 

AGCCAAAGACTTAATCTTAGATGGGGARXSAAGTGGCAGTGAAGGTTCT^ 

CTCGTTTTCAGOGTGAAGCGAGAGCTATGGCAGATCTAGAC(^ 

GGTCAACAGTATCTTQCAATGaAGTATGTTGCTGaACTAGACCTC^^ 

AGAAGC^TCOSTATCATGGGACAAATTCTCTTGGCT 

CTCAAAATATCCITTTGACACCAGATGGGAC^ 

ACCCAGACTAACTCX3ATGTTGGGCTGA6TTCATTACTTO 

TATCTATGCa^TGGGGATTATTTTCTATQAGATGTTGACAGGCC^ 

TCXAGOlTTTCCaGAACCCCCTGCOGTCCGTTATTGaaGAAAAT^^ 

GCAACTGCTAAAAAGTTGACCAATC6CTATCX;CTa36TTTCAm 

TAGAAATGAAAGTAAGTTAATCTTTGATGAAACGAGCAAGGGAGATA^^ 

CATCTATTCCTAAGGTTCAAGCGCAGACAGAACAaUU^TCAATCaAAA^ 
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CCACAAGCACCGAAAAAACATAGATTTAAGATGOSTTACCT^ 

TTGGATACTATCCAGAACTCCTGCAACCATTGCCATTCa^ 

AAAAAGCCAATTTTGAGATTGGTGAGGAGAAGACAGAJt^^ 

CK5CGCTGGAACTGGTCX5AAAAGAAGC3JU^CaAAAATTAAT^^ 

TGTOKSCCGGAAATCTTCTGATGTTATCGCXSGAATTAAA^ 

AGTCGAATGAAAGTGAGGCTCGAACGGTCCTGAAGCAAAGTCTA 

ATTGTTTTGACAGTAGCTAAAAAAGCTAOGACGATTC^^ 

ACTCAAGCAGAAGAAGGTTCCTGAGAATTTGATTftAGATAGAGGAAG 

AACAAAGTCCAGGTGCCGGAACGACTTATGATGTGAGTAAACCTACTCaAATTG 

GTTGCCATGCCGAGTTACATTGGTTCCAGCTTGGAGT^ 

TATAGAAGTTGTAGAAGTGACGACAGCGCCTGCAGGTAGTGTAGAAGQCATOGT^ 
AGGTAGACCTAAATAAGACTAGAGTCAAGATTTCAATCTACAAACCTAAAACAAC^ 



ATGTTTGCy^TCAAAAAGCXSAAAGAAAAGTAaiTTATTCAATO 

TCTTGTTATGGGAAGTGTGGTTOVTGCGACAGAGAACGAGGGAAGTACCC^^ 

AACATAGGAAAGCrcCTAAACAAGTCGTCGATGAATATATAGAAAAAATGTT^^ 

ACCCAAAATGTCXSCCrraAAACATAAAGTTGAGasa^TTAA^ 

GAAAGATGAGTTGCOSTCAGAAATAAAAGCAAAGTTAGACXSCAGCTTTT^ 

AAAAGGTAGCAGAAGCTAAGAAGAAGGTTGAAGAAGCTAAGAAAAAAGCOSAGGATC^^ 

CCa^CCAATACOTACAAAACGCTTGAACTTGAZ^TTGCT^ 

AGAGGAAGCTAAAGAATCTCGAAACGAGGGCACAATTAAGCAAGC^^AAAGAGA^ 

GGTTAGAAAACaiTCAAGACAGATCGTAAAAAAGCAGAAGAAGAAGCTAAAaSAAAAGm 

GTAGCGACTTCAGATCAAGGTAAACCAAAGGGGCGGGCa^AAACGAGGA 

AAATGATGCGAAGTCTTCAGATTCTAGOSTAGGTGAAGAAACTCrTCaUlGCTC^^ 

AAGCTGAGAAGAAGGTTGAAGAAGCTGAGAAAAAAGCO^GGATCAAAAAGAAGAAG^^ 

TACAAAACGCTTGACCTTGAAATTGCI^GTCa3ATGT^ 

GGAACCTOSAGACGAGGAAAAAATTAAGCAAGCAAAAGCnRAAGTT^^ 

TCAAGACaVGAT(^TAAAA7UVGa\GAAGAAGAAGCTAAACGAAAAGCAGC^ 

CAACCACAACCAGCGCCGGCTACTCAACCAGAAAAACCAGCTCa^^ 

AACAGATGATCAACAAQCTGAAGAAGACTATGCTCGTAGATCAGAAGAAGAATATAATCGC^^ 

AAACTGAAAAACO^GCACAACCyVTCTACTCCAAAAACAGGCTGGAAACa^ 

GGTTCAATGGCAACyVGGATGGCTCCMAAa^aSGTTCATGGTACTATCTA;^ 

CCAAAACAATGGTTOlTGGTACTATCTAAACGCTAATGGTTCaU^T^ 

ACCTAAACGCTAATGGTGCTATGGOSACaVGGATGGCTCCAATAa^T^ 

GCGACyVGGATGGCTCCAATACAATGGCTCa^TGGTACTACCTCSUVCXOT 

CXSGTTCATGGTACTACCTCaACGCTAATGCTGATATGGCGACAGGATG^ 

CTAATGGTGATATGGOSACAGGTTGGGTGAAAGATGGAGATACCTGGT^ 

CAATGGTTCyUAGTATCAGATAAATGGTACTATGTCaATGGCn?<a^ 

AGTCAATGCCAATGGTGAATGGGTAAACTAA 

SeqID 297 

ATGTTTGCATCAAAAAGCGAAAGAAAAGTACATTATTCAATTaSTAAATTTAGTGa^ 

TCTTGTTATGGGAAGTGTGGTTCATGCGAOWSAGAAaSAGGGAGCTACCCA^ 

GTCAGGCAGAACAAGGAGAACAACCTAAAAAACTCGATTCAGAACGAGATAAGGCyU^ 

AAAATAGTGGGTGAGAGCTATGCAAAATGAACTAAAAAGCGACATACAATTAC^ 

TAAGAACQAGTATTTGAATAAAATAGTTGAATCAACCTCAGAAAGCCAACT 

TAGATGAAGCTGTGTCTAAGTTTGAAAAGGACTCACCTTCTTaSTCAAGTTCA^ 

AGGAAGCTAAGGAACCTCGAAACGAGGAAAAAGTTAAGCAAGCAAAAGCGGAAGTTC^ 

TTAGAAAAAATCAAGACAGATCGTAAAAAAGCAGAAGAAGAAGCTAAACGAAAAGCaVGC^ 

ACCAGCTGAACAACCy^Ca^CCAGCGCCGGCTCCMAAGCAGAAA^ 

AACCAAAAGOlGAAAAACCAGCTGATCAACMGei^^ 

CAACAGGT^CCGCCAAAAACrKSAAAAACCAGCACAACCATCTACTCCA^ 

CTTCTACAATACTGATGGTTCaVATGGCGACT^GGATGGCTCCMAACAAT^ 

TGGCGACAGGATGGCTCCAAAACyU^TGGTTCATGGTACTATCTAAACGCTAATTC 

AATGGTTCATGGTACTACCTAAAC»CTAATGGTTCAATGGa3ACAGGAT^ 

CGCTAATGGTTCAATGGCGAmGGATGGCTCCAATACAATGGCTCATGGTACT 

GTTGGGTGAAAGATGGAGATACCTGGTACTATCn^GAAGCATCAGGTGCTATC 

AAATGGTACTATGTCAATGGCTOVGGTGCCCTTGCAGTCAACACIAACTO 

6GTAAAC 

SeqlD 298 

TTGACTAGTAAAGAGGAAGCTAAGAAGCCTTTAAACGAGGGCACAAOT 
TGAGGCTAO^GGTTAGATkAAAATCAAGACaGATaSTAAAAAAGCAGAAGAAGAAGCT 
AAGTTAAAGAAAAACCAGCTGAACAACCaVCAACCAGaKICGGCCCCTa^ 
GTTCCAGCTCCAAAACCAGAGAAGCCAGCTGAACAACCAAAACC^^ 
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TGAACAACOVAAAGCAGAAAAIUVCAGATGATCAAC^^ 

TGACTCMCAGCAACCXKMAAACCAGAGaACCa^^ 

TTCTACAATACTGATGGTTCAATGGCGAOWGGATGGCTCCA^ 

GG<M.CAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTr^^ 

TATCAGATAAATGGTACTATOTCAATGGCTCaWKSTGCCC^^ 

GGTGAATGGGTAAAC 

SeqID 299 

ATGTTTGCATCAAAAAGCGAAAGAAAAGTACaTTATTCAATTCGTAAAT^ 

TCHTOTTATGGGAAGTGTGGTTCATGCGACAGAGAAasaUSAG^ 

AACGTAGGAAAGCTOCTGAACMTTCGATGAATATATAAACSUkAATC^ 

GCCTTCaUlCATACaVGTTGAGCAGAATTAAAACGGAOTATTTGJ^^ 

AATAAAAGCAGAGTTAGACXSCaGCTTTTAAGayGTTTAAAAAAGATACAOT 

CTGAGAAQAAGGTTGAAGAAGCTGAGAAGAAGGTAfiCAGAAGCTAAGA;^^ 

AACTACCCAACCATTACTTACAAAACGCTTGACCTTCAAAT^ 

AGTAAAAAAGQAAGCTGACGAATCTCXa^aaAGGGCACAATTAACCAAGC^ 

CTACAAGGTTAAAAAAAATOy^CAGATCGTGAAAAAGCaaAAGAAGAAG 

AGCTTGAACTAGTAAAAGAGGAAGCTAAGGAATCTCGAAACGAGGAAAAAATTAAGCAAGC^^ 
AAAGerGAGGCTACAAGGTTAGAA/^AAATCAAGACyVGATOT 

AGATAAAGTTAAAGAAAAACCAGCTGAACAACCAa^CCAGOSCOGGCrCa^^ 

ATCCAGTTC<^GCTCCAAAACCaVGAGAATCCAGCTGAACyACaU^AAGCa^^ 

TATGCTCGTAGATCAGAAGAAGAATATAATCGCTTGACTCMCAGCAACCGC^^ 

AATAGGCnK3GAAACAAGAAAAa3GTATGTGGTACTTCTAC2UV 

GCTCATGGTACTACCTCAACAGCaU^TGGCGCTATGGCGAaVGGATGGCT 

AATGGTTCAATGGCAACAGGATGGCTCCAAAACAATGGTTGAT^ 

GCTCCaU^TACAATGGCTCATGGTACTACCTCAACGCTAATGGTGATATG^^ 

ACTACCTAAACGCTAATGGTGATATGGCGACAGGATGGCTCCAATACA^ 

ATGGCGACAGGTTGGGTGAAAGATGGAGATACCIX3GTACTATCTT^ 

AGTATCAGATAAATGGTACTATGTCAATGGCTCAGGTGCCCTTGCaiGTCAAaVCAACTC 

ATGGTGAAT66GTAAAC 

SeqID 300 

ATGTTTGCATCAAAAAGa3AAAGAAAAGTACATTATTCAATTa5TAAATCT 

CTTGTTCTTAGGAGGAGTAGTCCATGCAGAAGGGGTTAGAAGTGAGAATACCCCCAAGGTTAC^ 

ATGAATATATAAAAAAAATGTTGAGTGAGATCCAATTAGATAAAAGAAAACATACCCa^ 

AGCAGAATTAAAACGGAGTATTTGTATAAATTAAAAGTTAATGTTTTAGAAGAAAAGTC^^ 

AAAAAAAGAGGTAGACGCAGCTTTTGAGAAGTTTAAAAAAGATACATTGAAACT^ 

AGGTTGAAGAAGCTAAGAAAA7^6CC3^GGATaU^AAAGAAGAAGAT(^ 

GAACnmAATTGCI^GTCaSATGTGAAAGOT 

GGACy^auVTTAACCAAGCMAAGCGAAAGTTAAGAGTGAACAAGCT^ 
AACAAGCTCAGGCTAGAXXXXXXXJOaaCXXXXXXXXXXXXXXXXXK 

AGAAGCGGAGCTTGAACTAGTAAAAGAGGAAGCTAAGGAACCTCGAGACXSAGGAAAAAATTl^ 

AGAGTAAACAAGCnXSAGGCTACAAGGTTAGAAAAAATCAAGACAGATCGTAAAAAM 

GCAGAAGAAGATAAAGTTAAAGAAAAACCAGCTGAACM.CaVCAACCAGCGCa;!GCT 

ACCTIGAAAAACCAGCTCCAGCTCCAAAACCAGAGAATCCAGCTGA^ 

AAGAAGACTATGCTCGTAGATCAGAAGAAGAATATAATOSCTTGACTC^ 

CCATCTACTCCAAAAACAGGCTGGAAACAAGAAAAaSGTATGTGGTACTTCT 

GCrCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGGTGATATGGC^ 

ACTACCTAAATOCTAATGGTGATATGGCGACAGGATGGCTCCAAAACAATGGCT 

ATGGCGACAGGATGGCTCCAAAACAATGGCTCATGGTACTACCTA 

TGGAGATACCTGGTACTATCnTGAAGCATCAGGTGCTATGAAAGCAAGCC^ 

TCAATGGCTCAGGTOCCCTTGCAGTCAACACAACTGT 

SeqID 3 01 

ATGTTTGCATa^AAAAGOSAAAGAAAAGTACATTATTCaATTCGTA^ 

TCTTGTTATGGGAAGTGTGGTTCATGCGACAGAGAAGGAGGTAACTACCCAAGT^^ 

AACATAGQAAAGCTGCTAAACAA6Ta5TCX3ATGAATATATAQAAAAAATGTT^ 

ACCOiAAATTTCGCCTTa^CATGAAGTTGAGCGCAATTAAAAC^^ 

GTTGCCGTa^TCGGAAGCTGAGTTGCCGTa^GAAGTAAAAGCT^AAGTTAGAO^ 

TGAAACTAGGAGAAAAGGTAGCAGAAGCTGAGAAGAAGGTTGCAGAAGCTGAGAAAAAA^ 

CGCaSTAACTACCO^CCATTACTTACmAAACXKn'^^ 

TGAACTATTGAAAGAGGAAGCTAAAACTOaAAACAAGGACAaU^ 

AGGCTACAAAGTTAGAAGAAATCAAGACAGATCGTAAAAAAGCAGAAGAAGAAGCTAXXXXXS^^ 

XXXXmCXXXXXXXATTGCTGAGTCCGATGTGAAAGTTAAAGAAGCGGA 

CGAAACGAGQAAAAAGTTAAGCAAGCMAAGCGAAAGTTGAGAGTAAfl^^ 

AGATCGTAAAAAAGCAGAAGAAGCTAAACX3AAGAGCAGCAGA2U3AAGATAAAGTTAAAG^ 

CAGCGCCGGCTCCTCAACCAGAAAAACCAACTGAAGAGCCTGAGAATCCAGCT 
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CAACCAAAAGCAGMAAACCAGCTOATCAACAAGC^^ 

TCAACAGCAACCGCCAAAAACTGAAAAACCAGCACAACCATCTACTCC^^ 

ACTTCTACAATACTGATGGTTCAATGGCGACAGGATGGCTCCAAAAC^ 

ATGGCGACAGGATGGCTCCAATACAATGGTTCATGGTACTACCTa^ 

CAATGGTTCATGGTACTACCTCAACGCTAATGGTGATATGGCGAC^ 

ACGCTAATGGTGATATGGaSACAGGATGGCTCCAAAAa^TGGCTCAT^ 

GGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGO^TCAOT 

TAAATGGTACTATGTCAATGGCTCAGGTGCCCTTGCM 

GG6TAACC 



ATGTTTGCATCAJUUUlGaSAAAGAAAAGTACATTATTCAATTCXJTAAATTTA^ 

CTTCTTCITAGGAGGAGTAGTCCATGCAGAAGGGGTTAGAAGTGAGAATATC^ 

ATGAATATATAAAAAAAATGTTGAGTGAGATCCAATTAGATAAAAGAAAACATACCCAO^ 

AGCAGAATTAAAACGGAGTATTTGTATAAATTAAAAGTTAATGTTTTAGAAGftA^ 

AAAAAiUVGAGGTAGACGCAGCTTTTGAGAAGTTTAAAAAAGAT^ 

AGGTTGAAGAAGCTAAGAAAAAAGCCAAGGATCAAAAAGAAGAAGATCACCGTAACTACC^ 

GAACTTGAAATTGCrcAGTCCQATGTGAAAGTTAAAGAAGCGGAGC^^ 

GGACACa^TTAACCAAGCAAAAGCGAAAGTTAAGAGTGAACAAGC^ 

AACAAGCTGAGGCTACAAGGTTAGAZ^CATCaUVGACAGATaSTGAAA 

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCnTOAl^TTGC^^ 

TAGTAAAAGAGGAAGCTAAGGAATCTCXSAAACGAGGAAAAAGTTAAGCAAGCAAAAGCra^ 

GCTACAAGGTTAGAAAAAATCAAGACAGATCGTAAAAAAGCAGAAGAAGAAaCTAAAC^^ 

TAAAGAAAAACCAGCTGAACAACCAOU^CCAGCGCCGGCTCCTCAACC^ 

CAGCTGAACAACCAAAAGCAGAAAAACCAGCI^TCAAC:^ 

CGOTI^CTCAACAGCAACaK:a^AAAACTGAAi^ 

TATGTGGTACm'CTACAATACTGATGGTTCaiATGGCGACAGGATGGCTCC^ 

ATGGTGATATGGOSACAGGATGGCrrCCAAAACAATGGTTCy^T^ 

CTCCAAAACAATGGCTCa^TGGTACTACCTAAACGCTAATGGTGATATGGC^ 

CTACCTAAACXX^TAATGGTGATATGGOSACAGGTTGGGTGAAAGATGGAGAT^ 

TGAAAGCAAGCCAATGGTTCAA^GCATCAGATAAATGGTACTATGTC^ 

GATGGCTATGGAGTCAATGCCAATGGTGAATGGGTAAAC 

SeqID 303 

ATGTTTGCATCAAAAAGCGAAAGAAAAGTACATTATTCAATTCX5TAAATTO 

TCTTTTTATGGGAAGTGTGGTTCATGCGACAGAGAAGGAGGTAACTACCC^ 

GTCAGACAGAACATATGAAAGCTGCTAAACAAGTCXSATGAATATATAAAAAAl^ 

CAAAATGTCGGCTTACTCACAAASTTGGGaSTAATTAAAACGGAGTATT^ 

AGCTGAGTTGCOTTCAGAAATAAAiUSCAAAGTTAGACGCAGCTTT^^ 

GAAAAAAGGTAGCaJGAAGCTGAGAAGAAGGTTGAAGAAGCTTiAGAAAAAM 

TACCC?^CaU^TACTTACAAAACGCTTGAACTTGACATTGC^^ 

AAAAGGGAGCTACAGGAATCTOlAGAaSAGAAAAAAATTAATCAAGCAAAGCX^^ 

TCAATGGCa^CAGGCI^CTCCAAAACAATGGCTCATGGTACT 

AAACAATGGCrCATGGTACTACCTCAACAGCAATGGCGCTATGGO^ 

TCAACGCTAATGGTOATATGGCGACAGGATGGTTCCAATACAATGGTTm 

AOVGGATGGTTCGAATACAATGGTTGATGGTACTACCTO^CGCTAAT^ 

TTCATGGTACTACCrCAACGCTAATGGTGATATGGCGACAGG^ 

ATGGTGCTATGGTAACAGGATGGCTCCAAAACAATGGCTCy^TGGTACTACCTAAAOT 

GTGAAAGATGGAGATACCTGGTACTATCTTGAAGCaiTCAGGTGCTATGAAAGC^ 

GTACTATGTCAATGGCTCaGGTGCCCTTGCafiTCAAC^ 

AC 

SeqID 304 

MKKKILASLIJ^STVMVSQVAVLTTAHAETTDDKIAAQD^ 

KKIiEGEITELSKNIVSIUTQSLEKQARSAQTNGAOTSyiin'IVNSKSITBAISRVAAMSBIVSANl^^ 
ANKTDAIimnJ^QQKLADDAOAIiTTKQAEIiKAAELSIiAAEK^ 
IiASAimOiTAQVQAVSESAAAPVRAKVRPTYSTNASSYPIGECTOGVOT 
XACWinSGGYGHVAVVTAVESTTRIQVSESNYAGMRTICaiHRGWFN^ 

SeqID 305 

r«CKKIIiASLLLSTVMVSQVAVLTTAHAETTDDK^ 

KKLEGEITELSKNIVSRNQSIiEKQARSAQTNGAVTSYINTIVNSKSITEAISRVAAMSEn^^ 
ANl^AIimriANQQKLADDAQALTTKQAELKAAELSLAAEKATAEGEKASI^ 
I^ANTNLTAQVQAVSESAAAPVRAKVRPTYSTNASSYPIGBCTWGVK^^ 
lACWiroGGYGHVAVVTAVESTTRIQVSESNYAGNRTIGNHRQWEb^^ 

SeqID 306 
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MIQIGKlPACSRYRTVKQIGIUSGMADVyiJUmiilliDGEEVAV^^ 
GQQYIAMEYVAGIiDLKRYIKEHyPIiSNBBauaaMOQIIiI^^ 

TQTNSMMSVHYLSPEQAGGSKATVQSDIYRMSIIFYBMLTGHIPYDGDSAVTIAW 

ATAKKIiTmYRSVSKMYVDLSSSLSYireRiraSiaLIFDFrSKADTKT^ 

PQAPKiOniFKMRyiilliASLVIiVAASLIWIIiSRTPATIAIPDVAGQT^ 

GAGTGRKEGTKIliniiWSSGKQSFQISKYVGRKSSDVIABIJCBiaC^ 

IVLTVAKKVTSVAMPSYIGSSLEFTKNNLIQIVGIKEfiNIBV^ 

TTSATP 

SeqID 307 

MIQIGKIFAGRYRrTOQIGRGGMADVYIAKDLILDGEEVAVKVLRTOT 
GQQYIiAMEYVAGIiDLIOiyiKEHYPLSNEEAVRIMGQIIiLAMRi;^^ 

TQTNSMLGSVHYLSPBQARGSKATVQSDIYAMGIIPYBBgiTGHIPYDGDSAVTIALQHFQKP 

ATAKIOjTNRYRSVSEMYVDIiSSSLSYNRIUireSKLIFDBTSKaDTKm 

PQAPKKHRFKMRYLILIiASLVLVAASLIWIJ^RTPATIAIPiyK&GQT^^ 

GAGTGRKEGTKlOTJWSSGKQSFQISNYVGRKSSDVIAEIiKEKKVPDNLIKIKE^ 

IVLTVAKKATTIQIKSMYIGRNSTEVISEIiKQKKVPEOTiIKI^^ 

VAMPSYIGSSIiEFTKNNIiIQIVGIKEANIEVVEVTTAPAGSVBGMVTO 

SeqID 308 

MIQIGKIPAGRYMVKQIGRGGMADVYIiAKDLIIiDGEEVAVKVLRTNYQ^ 

GQQYIJUyiEYVAGIJJl^YIKKHYPLSNEEAVRIMGQILIiA^^^ 

TQTNSMIKSSVHYLSPEQARGSKATVQSDIYAMGIIFYEMLTGHIPYIKro 

ATAIOCLTNRYRSVSEMYTOLSSSLSYNRRNESKIiIPDBTSKAOTKm 

PQAPKKHRFKMRYLILIiASLVIiVAASLIWILSRTPATIAIPDVAGQr^^ 

GAGTGRKEGTKIlTLWSSGKQSFQISimraRKSSDVIAELKEKKVPDNIjIKIEBBBSl^ 

IVLTVAKKATTIQIiGNYIGRNSTEVISELKQraCVPENnLjIKIEEEBSSESEPGTIMKQSPG^ 

VAMPSYIGSSIiEFTKNOTiIQIVGIKEi^IEVVBVTTAPAGSABGMVVEQSPRAGEKVD 

SeqID 309 

MIQIGKIFAGRYRrVKQIGRGGMADVYIAKDIiIIjDGEEVAVKVIjRT^ 

GQQYIiAMEYVAGIiDIjKRYIKEHYPLSNEEAVRIMGQILLAMRIiAOT 

TQTNSMI^SVHYLSPEQARGSKATVQSDIYANK5IIFYEMIiTGHIPYDGDSAW 

ATAKKIjTNRYRSVSEMYVDLSSSLSYNRRNESKLIFDETSKADTKTLPKVSQSTLTSIPKVQAQTC 

PQAPKKHRFKMRYIiILIiASIjVIjVAASLIWILSRTPATIAIPDVAGQTVAEAKATIi^^ 

GAGTGRKEGTKINXiWSSGKQSPQISNWGRKSSDVIAELKEKKVPDNLIKIEEEESNESEAGTV^ 

iiltvakkattiqiignyigrnstbviselkqkkvpenlikieeebssesepgtimkqspgagtto 
vampsyigssleetknnliqivgikbanievvevttapagsaegmvveqspragekvdi^^ 

SeqID 310 

MIQIGKIFAGRYRIVKQIGRGGMTU^VYIiAKDIilLDGEEVAVKVLRTOT 
GQQYIiAMBYVAGLDLKRYIKEHYPLSNEEAVRIMGQILIiAMRIiAHTRGrVHm 
TQTNSMLGSVHYIiSPEQARGSKATVQSDiyAMGIIFYEPILTGHIPYDGDSAVTIAIiQHFQNPLPS 
ATAKKXjTNRYRSVSEMYVDLSSSIiSYNRRNESKLIFDETSKADTKT^ 

PQAPKKHRFKMRYIjIIjIiASIiVIjVAASIiIWIIiSRTPATIAI PDVAGQTVAEAKATLKKANFEIGBEKTEASBKVBBGRI IRTD: 
GAGTGRKEGTKINIjWSSGKQSFQISNYVGRKSSDVIAEIjKEKKTODNLIKIEEBE 
IVIjTVAKKATTIOLGNYIGRNSTEVISEIiKQKia^ENLIKIEEEESSESEPGTIMKQSPGAGTT^ 
VAMPSYIGSSLEPTKNNIilQIVGIKEAiniEVVEVTTAPAGSVEGMVVEQSP^ 

SeqID 311 

MPASKSERKVHYSIRKFSIGVASVAVASLVMGSVVHATENBGSTQAATSSNMA^^ 
TQNVALNIKLSAIKTKYliREIilT^n^KSm 

ptittyictleleiaefdvkvkeaeiielvkeeakesrnegtik^ 

vatsdqgkpkgrakrgvpgexiatpdbacendakssdssvgeetiipssslksgkkvaeaeka^ 

yktiidleiaesdvkvkeaeleiivkeeakeprdebkikqakakveskkaeatr^^ 

qpqpapatqpekpapkpekpaeqpkaektddqqaebdyarrseeeynrltqqqppktekpaqpstpktgwto 

GSMATGWIiQNNGSWYyiiNANGAMATGWLQNNGSWyyii^^ 

ATGWIiQYNGSWYYXjKO^GDMATGWIiGOTTGSWyYIjlOVNGDMATO 

QWFKVSDKWYYVlKSSGAIAVinrrVDGYGVlUkNG 

SeqID 312 

mpaskserkvhysirkpsvgvasvvvaslvmgsvvhatenegatqvptssnranesq?^ 
kivgesyakstkkrhtxtvalvnelnnikneylnkivestsesqlqii^^ 

skkaeatrijbkiktorkkaeeeakrkaaeedkvkekpabqpqpapapkaekpa^ 

eeeynrltqqqppktekpaqpstpktgwkqengmwyfyot13gsmatgwijqimgswyninsngj^ 

matgwlqmngswyyiinangsmatgwi^yngswyyiilc^ 

SQWFKVSDKWYYVNGSGAIiAVNTTVDGYGVNANGEWVN 
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SeqID 313 



WYYIlNl^GS^!ATGVm□3GDTWyYIlBASGAMKASQWPKVSDK^ 
SeqXD 3X4 

MFMKSERKTOySIMCFSVGVASVAVMLVMGSVVHATENBRTTO 
APNIQIiSRIKTEYIiNGIJCEKSEAELPSKIKABIjDAAPKQPKKD^ 
NYPTITYKTIiDLEXABPDVKVKBAELKLVKKEADBSM^ 



PADQQAEEDYARRSBBBYNRLTQQQPPKPEQPAPAPKIGWKQENGMWYFYOTTCSMAT^^ 

NGSWYYLNANGDMATGWLQYNGSWYYIiNANGDMATGWLQYl^ 

WYYLEASGAMKASQWFKVSDKWYYVNGSGMiAVim^GYGVI^^ 

SeqID 315 

MPASKSERKVHYSIRKFSIGVASVAVASLFTjGGVVHAEGVRSENTPKVTSSGD^^ 

SRIKTEYIiYKIiKVNVIiEEKSKAELTSKllCKBVDAAPBKPK^^ 

ELEIAESDVKVKEAKI^LIiKEEAKTRNEDTINQAKArVKS 

AEEBAKRKAAEBDKVKEKPAEQPQPAPAPQPEKPTPKPEKPAPAPKPBNPAEQPKM 

PPKTEKPAQPSTPKTGi^KQBNGMWYFYNTDGS^!A^TOWIlQYlraSWYyIlNA^^ 

WYYIJiQMTGDMATGWIiQNNGSWYYIiKANGDMATGWV^ 

AN6EWVN 

SeqID 316 

MFASKSERKVHYSIRKPSIGVASVAVASLVWKMVVHATEKBV^^ 

TQNFAFNMKIiSAIKTErSXYGLKEKSEAEXiPSSBAELPSEVKiiXIiD^^ 

RIOryPTITYKTLDIiBIABSDVEVKKABIiBIiiaaS 

ATRIiE£CIKTDRKJCAEEAKRRAA££DKVK£ia?i^QPQPAPAPQPEKPT^ 
SEEEYlWUJTQQQPPKTEKPAQPSTPKTOWKQENCaawyPYOTBGSMATG 

ASQWFKVSDKfimrVN6SGAIAVXn?TVDGYGVN2^ 

SeqID 317 

MFASKSERKVHYSIRKFSIGVASVAVASLFLGGVVHAEGVRSENTPI^ 

SRIKTEyiiYKLRVNVX>EEKSKAELTSKTKKEVDAAFEKFR^^ 

ELEXABSDVKViaBAEIiEIiIiIISEAKTRIlEDTXl^^ 

AKVESKQABATRIiEKIKTDRKKAEEEiUCRKAAEEDKVKEKPA^ 
AIU^BEEYXQUjTQQQPPKTEKPAQPSTPKTGWKC^GMWYFYNTDGSimTGff^ 
ANGDMATGVniKSNNGSWyyitNANGDMATGW^ 
GAIiAVNTTVDGYGVMANGEWVH 

SeqID 318 

MFASKSERECVHYSIRKFSIGVASVAVASIiFMGSVVHATEKEVT^ 

QNVGLIiTKIiGVIKTEYtaGIiSVSKKKSEAELPSEXKAK^^ 

YPTNTXKTLEIiDIAESDVBVin^aETiBLVKGSYRmjKTJtUi^^ 

QYNGSWYYI^NANGDMATGHFQYMGSWYYIiNANGDMATGWFQyNGSWYyi^^ 

GSWYYZ^SNGAnnrrGVinbQKNGSWyyiiMaNGSMATDWVia^^ 

VNANGEWVN 



Note: "X" represents undefinect/mlssliig nucleotides or amino acides due t 
xmavailable sequencing information. 



